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AMENDED CLAIMS 

[received by the Imemational Bureau on 21 July 1998 (21.07.98); 
original claira 2 amended; new claims 49-65 added; 
remaining claims unchanged (4 pages)] 



1 . A transgenic non-human animal having a transgcne disrupting or interfering 
with expression of growth differentiation fector-8 (GDF-8) chromosomally 
integrated into the germ eells of the animal. 

2. The transgenic anunal of claim 1 , wherein the nnimal is selected from the 
group of species consisting of murine, avian, bovine, ovine, piscine, murine, 
and porcine. 

3. The transgenic animal of claim 1 where the species is avian. 

4. The transgenic animal of claim 1 where the species is bovine. 

5. The transgenic animal of claim 1 where the species is porcine. 

6. The nansgenic animal of claim 1 where the species is ovine. 

7. The Transgenic animal of claim 1 where the species is piscine. 

8. The transgenic animal of claim 1 . wherein the transgene comprises GDF-8 
antisensc poIynucleotidc(s). 

9. The transgenic ammal of claim 1 , wherein the transgene compiises a gene c- 
ncoding a dominant negative GDF-8 polypeptide, 

10. The Tzansgcnic animal of claim 1 , wherein the animal is homozygous or 
heterozygous for GDF-8 polynucleotide. 

11. A chicken or turkey egg produced by fte transgenic animal of claim 3. 
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44. A method for identifying a compound that afifects GDF-8 activity or gene 
expression comprising: 

a) incubating die compoxind with GDF-8 polypeptide, or with a 
recombinant cell expressing GDF-8 under conditions sufficient to allow the compo- 
nents to interact; and 

b) determining the effect of the compound on GDF-g activity or 

expression. 

45. The method of claim 44, wherein the effect is inhibition of GDF-8 activity or 
expression. 

46. The method of claim 44, wherem the effect is stimulation of GDF-8 activity or 
expression, 

47. An isolated polynucleotide encoding a tmncated GDF-8 polypeptide wherein 
the truncation is a loss of the C-temiinal active fragment of GDF-S. 

48. The isolated polynucleotide of claim 47, wherein the polynucleotide is as 
shown in FIGURE 12a, 

49. The transgenic animal of claim 1 where the species is murine, 

50. A method for producing a transgenic non-human anim al having a phcnotype 
characterized by expression of a transgene otherwise not naturally occurring, 
wherein ejjpression of the transgene disrupts or interferes with growtii differ- 
entiation fector-8 (GDF-8) activity, comprising: 

a) introducing a transgene in operable liiikage with at least one 
ejqjression regulatory sequence into a zygote of an animal; 

b) transplanting the zygote of a) into a pseudopregnant animal; 

c) allowing the zygote to develope to term; and 

d) identifying at least one transgenic offepring from c) where 

expression of the transgene disrupts or interferes with GDF-8 
activity. 
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5 1 . The method of claim 50. wherein the introduction of the tnmsgene into the 
cmbiyo is by introducing an embiyonic stem cell containing the tiansgene into 
the embryD. 

52. The method of claim SO, wherein the introduction of the iransgene into the 
embryo is by infecting the embryo with a vims containing the transgene. 

52. The method of claim 52, wherein the virus is a retrovirus. 

53. Hie method of claim 50, wherein the tiansgene encodes GDF-8 antisense 
poIynucleotide(s). 

54. The method of claim 50, wherein the transgene encodes a dominant negative 
protein that disrupts or interfcies with GDF-8 activity. 

55. The method of claim 50, wherein the transgenic animal is homozygous or 
heterozygous for GDF-8 polynucleotide. 

56. The method of claim 50, wherein disrupting or interfering with GDF-8 activity 
in the transgenic non-human animal produces increased muscle mass as 
compared to a non-transgenic animal of the same species. 

57. The method of claim 50, wherein the animal is selected from the group 
consisting of murine, avian, bovine, ovine, piscine, and porcine. 
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A metiiod for producing a transgenic non-human animal having a phenotypc 
characterised by expression of a transgene otherwise not naturally occuoing, 
vvherdn expression of the transgene disrapts or interferes with expression of 
growth differentiation factor-8 (GDF-8X comprising: 

a) introducing a transgene in operable Unkage with at least one 
expression regulatory sequence into a zygote of an animal; 

b) transplanting the zygote of a) into a pseudopregnant animal; 

c) allowing the zygote to develope to term; and 

d) identifying at least one transgenic offspring from c) vAiexe 
expression of the transgene disrupts or interferes with expres- 
sion of QDF-8- 

The method of claim 58, wherein the introduction of the transgene into the 
embryo is by introducing an embryonic stem cell containing the transgene into 
the embryo. 

The method of claim 58, wherein the introduction of the transgene into the 
embryo is by infecting the embryo with a virus containing the transgene. 

Themethodof claim 60, wherein the virus is a retrovirus. - 

The metiiod of claim 58, wherein the transgene encodes GDF-8 antisense 
polynucleotide(s). 

The method of claim 58, wherein the transgenic aninaal is homozygous or 
heterozygous for GDF-8 polynucleotide. 

The method of claim 58, wherein disrupting or interfering with GDF-8 activity 
in the transgenic non-human animal produces increased muscle mass as 
compared to a non-transgenic animal of the same species. 

The method of claim 58. wherein the animal is selected from the group 
consisting of murine, avian, bovine, ovine, piscine, and porcine. 

AMENDED SHEET (ARTICLE 19) 
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The invention also provides a method for a method for identifying a compound that 
affects GDF-8 activity or gene expression includng incubating the compound with GDF- 
8 polypeptide, or with a recombinant cell expressing GDF-8 under conditions sufficient 
to allow the compounds to interact and determining the effect of the compound on GDF- 
5 8 activity or expression. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE la is a Northem blot showing expression of GDF-8 mRNA in adult tissues. The 
probe was a partial murine GDF-8 clone. 

FIGURE lb is a Southern blot showing GDF-8 genomic sequences identified in mouse, 
10 rat, human, monkey, rabbit, cow, pig, dog and chicken. 

FIGURE 2 shows partial nucleotide and predicted amino acid sequences of murine 
GDF-8 (FIGURE 2a; SEQ ID NO: 1 1 and 12, respectively), human GDF-8 (FIGURE 2b; 
SEQ ID NO: 13 and 14, respectively), rat GDF-8 (FIGURE 2c; SEQ ID NO: 24 and 25, 
respectively) and chicken GDF-8 (FIGURE 2d; SEQ ID NO: 22 and 23, respectively). 
15 The putative dibasic processing sites in the murine sequence are boxed. 

FIGURE 3 a shows the alignment of the C-terminal sequences of GDF-8 with other 
members of the TGF-P superfamily. The conserved cysteine residues are boxed. Dashes 
denote gaps introduced in order to maximize alignment. 

FIGURE 3 b shows the alignment of the C-terminal sequences of GDF-8 from human, 
20 murine, rat and chicken sequences. 

FIGURE 4 shows amino acid homologies among different members of the TGF 
superfamily. Nimibers represent percent amino acid identities between each pair 
calculated from the first conserved cysteine to the C-terminus. Boxes represent 
homologies among highly-related members within particular subgroups. 
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FIGURE 5 shows the sequence of GDF-8. Nucleotide and amino acid sequences of 
murine (FIGURE 5a and 5b)(GenBank accession number U84005; SEQ ID NO: 1 1 and 
12, respectively) and human (FIGURE 5c and 5d; SEQ ID N0:13 and 14. respectively) 
GDF-8 cDNA clones are shown. Numbers indicate nucleotide position relative to the 5' 
5 end. Consensus N-linked glycosylation signals are shaded. The putative RXXR 
proteolytic cleavage sites are boxed. 

HGURE 6 shows a hydropathicity profile of GDF-8. Average hydrophobicity values for 
murine (FIGURE 6a) and human (FIGURE 6b) GDF-8 were calculated using the method 
of Kyte and Doolittle (J. Mol. Biol., 152:105-132, 1982). Positive numbers indicate 
10 increasing hydrophobicity. 

HGURE 7 shows a comparison of murine and human GDF-8 amino acid sequences. The 
predicted murine sequence is shown in the top Unes and the predicted human sequence 
is shown in the bottom lines. Numbers indicate amino acid position relative to the 
N-terminus. Identities between the two sequences are denoted by a vertical line. 

15 FIGURE 8 shows the expression of GDF-8 in bacteria. BL21 (DE3) (pLysS) cells 
canying a pRSET/GDF-8 expression plasmid were induced with i- 
sopropylthio-p-galactoside, and the GDF-8 fusion protein was purified by metal chelate 
chromatography. Lanes: total=total cell lysate; soluble=soluble protein fraction; 
insoluble=insoluble protein fraction (resuspended in 1 0 Mm Tris pH 8.0, 50 mM sodium 

20 phosphate. 8 M urea, and 10 mM P-mercaptoethanoI [buffer B]) loaded onto the column, 
pellet=insoluble protein fraction discarded before loading the column; flow- 
thn)ugh=pTotems not bound by the column; washes=washes carried out in buffer B at the 
indicated pH's. Positions of molecular weight standards are shown at the right. Arrow 
indicates the position of the GDF-8 fusion protein. 
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FIGURE 9 shows the expression of GDF-8 in mammalian cells. Chinese hamster ovarj' 
cells were transfected with pMSXND/GDF-8 expression plasmids and selected in G4 1 8. 
Conditioned media from G418-resistant cells (prepared from cells transfected with 
constructs in which GDF-8 was cloned in either the antisense or sense orientation) were 
5 concentrated, electrophoresed under reducing conditions, blotted, and probed with 
anti-GDF-8 antibodies and [*^^l]iodoproteinA. Arrow indicates the position of the 
processed GDF-8 protein. 

FIGURE 10 shows the expression of GDF-8 mRNA. Poly A-selected RNA (5|ig each) 
prepared from adult tissues (FIGURE 10a) or placentas end embryos (FIGURE 10b) at 
10 the indicated days of gestation was electrophoresed on formaldehyde gels, blotted, and 
probed with full length murine GDF-8. 

FIGURE 11 shows chromosomal mapping of human GDF-8. DNA samples prepared 
from human/rodent somatic cell hybrid lines were subjected to PCR, electrophoresed on 
agarose gels, blotted, and probed. The human chromosome contained in each of the 
15 hybrid cell lines is identified at the top of each of the first 24 lanes (1-22, X, and Y). In 
the lanes designated M, CHO, and H, the starting DNA template was total genomic DNA 
from mouse, hamster, and human sources, respectively. In the lane marked Bl, no 
template DNA was used. Numbers at left indicate the mobilities of DNA standards. 

Figure 12a shows a map of the GDF-8 locus (top line) and targeting construct (second 
20 line). The black and stippled boxes represent coding sequences for the pro- and 
C-terminal regions, respectively. The white boxes represent 5' and 3' untranslated 
sequences. A probe derived from the region downstream of the 3' homology fragment 
and upstream of the most distal Hindin site shown hybridizes to an 1 1 .2 kb Hindlll 
fragment in the GDF-8 gene and a 10.4 kb fragment in an homologously targeted gene. 
25 Abbreviations: H, Hindlll; X, Xba I. 
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Figure 12b shows a Southern blot analysis of offspring derived from a mating of 
heterozygous mutant mice. The lanes are as follows: DNA prepared from wild type 129 
SV/J mice (lane 1). targeted embryonic stem cells (lane 2), Fl heterozygous mice (lanes 
3 and 4), and offspring derived from a mating of these mice (lanes 5-13). 

5 Figure 1 3 shows the muscle fiber size distribution in mutant and wild type littermates. 
Figure 13a shows the smallest cross-sectional fiber widths measured for wild type (n = 
1761) and mutant (n= 1052) tibialis cranial. Figure 1 3b shows wild type (n= 900) and 
mutant (n = 900) gastrocnemius muscles, and fiber sizes were plotted as a percent of total 
fiber number. Standard deviations were 9 and 1 0 f.m, respectively, for wild type and 

0 mutant tibialis cranial is and 11 and 9 ^m, respectively, for wild type and mutant 
gastrocnemius muscles. Legend: o-o, wild type; mutant. 

Figure 14a shows the nucleotide and deduced amino acid sequence for baboon GDF-8 
(SEQ ID NO: 1 8 and 1 9, respectively). 

Figure 14b shows the nucleotide and deduced amino acid sequence for bovine GDF-8 

1 (SEQ E> NO: 20 and 2 1, respectively). 

Figure 14c shows the nucleotide and deduced amino acid sequence for chicken GDF-8 
(SEQ ID NO:22 and 23, respectively). 

Figure 14d shows the nucleotide and deduced amino acid sequence for rat GDF-8 (SEQ 
ID NO:24 and 25, respectively). 

Figure He shows the nucleotide and deduced amino acid sequence for turkey GDF-8 
(SEQ ID NO:26 and 27, respectively). 

Figure 14f shows the nucleotide and deduced amino acid sequence for porx^ine GDF-8 
(SEQ ID NO:28 and 29, respectively). 

Figure 14g shows the nucleotide and deduced amino acid sequence for ovine GDF-8 
(SEQ ID NO:30 and 3 1, respectively). 
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Figures 1 5a and 15b show an alignment between murine, rat, human, porcine, ovine, 
baboon, bovine, chicken, and turkey GDF-8 amino acid sequences (SEQ ED NO: 12, 25, 
14, 29, 31, 19. 21, 23 and 27, respectively). 

DETAILED DESCRIPTION OF THE INVENTION 

5 The present invention provides a growth and differentiation factor, GDF-8 and a 
polynucleotide sequence encoding GDF-8. GDF-8 is expressed at highest levels in 
muscle and at lower levels in adipose tissue. 

The animals contemplated for use in the practice of the subject invention are those 
animals genendly regarded as useful for the processing of food stuffs, i,e. avian such as 

10 meat bred and egg laying chicken and turkey, ovine such as lamb, bovine such as beef 
cattle and milk cows, piscine and porcine. For purposes of the subject invention, these 
animals are referred to as "transgenic" When such animal has had a heterologous DNA 
sequence, or one or more additional DNA sequences normally endogenous to the animal 
(collectively referred to herein as "transgenes") chromosomally integrated into the germ 

15 cells of the animal. The transgenic animal (including its progeny) will also have the 
transgene integrated into the chromosomes of somatic cells. 

The TGF-P superfamily consists of multifunctional polypeptides that control prolifera- 
tion, differentiation, and other ftmctions in many cell types. Many of the peptides have 
regulatory, both positive and negative, effects on other peptide growth factors. The 
20 stmctural homology between the GDF-8 protein of this invention and the members of the 
TGF-p family, indicates that GDF-8 is a new member of the family of growth and 
differentiation factors. Based on the known activities of many of the other members, it 
can be expected that GDF-8 will also possess biological activities that will make it usefiil 
as a diagnostic and therapeutic reagent. 



25 In particular, certain members of this superfamily have expression patterns or possess 
activities that relate to the function of the nervous system. For example, the inhibins and 
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activins have been shown to be expressed in the brain (Meunier, et al, Proc. Natl. Acad. 
Sci, 85:247, 1988; Sawchenko, etal., Nature, 334:615, 1988), and activin has been 
shown to be capable of functioning as a nerve cell survival molecule (Schubert, et al.. 
Nature, 344:868, 1990). Another family member, namely, GDF-1, is nervo^is sys- 
5 tern-specific in its expression pattern (Lee, S.J., Proc. Natl. Acad Sci.. USA, 88:4250, 
1991), and certain other family members, such as Vgr-1 (Lyons, et al.. Proc. Natl. Acad 
Sci.. 0X4, 86:4554, 1989; Jones, etal.. Development, m:52l, 1991), OP-1 (Ozkaynak, 
et al.. J. Biol. Chem., 267:25220, 1992), and BMP-4 (Jones, et al. Development, 
m:531, 1991), are also known to be expressed in the nervous system. Because it is 

10 known that skeletal muscle produces a factor or factore that promote the survival of 
motor neurons (Brown, Trends NeuroscL, 7:10, 1984), the expression of GDF-8 in 
muscle suggests that one activity of GDF-8 may be as a trophic factor for neurons. In this 
regard, GDF-8 may have appUcations in the treatment of neurodegenerative diseases, 
such as amyotrophic lateral sclerosis or muscular dystrophy, or in maintaining cells or 

1 5 tissues in culture prior to transplantation. 

GDF-8 may also have applications in treating disease processes involving muscle, such 
as in musculodegenerative diseases or in tissue repah: due to trauma. In this regard, many 
other members of the TGF-p family are also important mediators of tissue repair. TGF-P 
has been shown to have marked effects on the formation of collagen and to cause a 

20 striking angiogenic response in the newborn mouse (Roberts, et al. Proc. Natl Acad 
Sci.. USA 82:4167, 1986). TGF-P has also been shown to inhibit the differentiation of 
myoblasts in culture (Massague, et al, Proc. Natl Acad. Sci., USA 83:8206, 1986). 
Moreover, because myoblast cells may be used as a vehicle for delivering genes to 
muscle for gene therapy, the properties of GDF-8 could be exploited for maintaining 

25 cells prior to transplantation or for enhancing the efficiency of the fiision. 

The expression of GDF-8 in adipose tissue also raises the possibUity of applications for 
GDF-8 in the treatment of obesity or of disorders related to abnormal proUferation of 
adipocytes. In this regard, TGF-P has been shown to be a potent inhibitor of adipocyte 
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differentiation in vitro (Ignotz and Massague, Proc, Nad. Acad Sci„ USA 82:8530, 
1985). 

Polypeptides, Polynucleotides, Vectors and Host Cells 

The invention provides substantially pure (jDF-8 polypeptide and isolated polynucleo- 
5 tides that encode GDF-8. llie term "substantially pure" as used herein refers to GDF-8 
which is substantially free of other proteins, lipids, carbohydrates or other materials with 
which it is naturally associated. One skilled in the art can purify GDF-8 using standard 
techniques for protein purification. The substantially pure polypeptide will yield a single 
major band on a non-reducing polyacrylamide gel. The purity of the GDF-8 polypeptide 
10 can also be determined by amino-terminal amino acid sequence analysis. GDF-8 
polypeptide includes fiinctional fragments of the polypeptide, as long as the activity of 
GDF-8 remains. Smaller peptides containing the biological activity of GDF-8 are 
included in the invention. 

The invention provides polynucleotides encoding the GDF-8 protein. These polynucleo- 
15 tides include DNA, cDNA and RNA sequences which encode GDF-8. It is understood 
that all polynucleotides encoding all or a portion of GDF-8 are also included herein, as 
long as they encode a polypeptide with GDF-8 activity. Such polynucleotides include 
naturally occunring, synthetic, and intentionally manipulated polynucleotides. For 
example, GDF-8 polynucleotide may be subjected to site-directed mutagenesis. The 
20 polynucleotide sequence for GDF8 also includes antisense sequences. The polynucleo- 
tides of the invention include sequences that are degenerate as a result of the genetic 
code. There are 20 natural amino acids, most of which are specified by more than one 
codon. Therefore, all degenerate nucleotide sequences are included in the invention as 
long as the amino acid sequence of GDF-8 polypeptide encoded by the nucleotide 
25 sequence is fiinctionally unchanged. 

Specifically disclosed herein is a genomic DNA sequence containing a portion of the 
GDF-8 gene. The sequence contains an open reading frame corresponding to the 
predicted C-terminal region of the GDF-8 precursor proteiiL The encoded polypeptide 



wo 98/33887 



PCTaJS98/02479 



12 



is predicted to contain two potential proteolytic processing sites (KR and RR). Cleavage 
of the precursor at the downstream site would generate a mature biologically active 
C-terminal fragment of 109 and 103 amino acids for murine and human species, 
respectively, with a predicted molecular weight of approximately 12,400. Also disclosed 

5 ^-ef^ll length murine and human GDF-ScDNA sequences. The murine pi^.pro-GDF-8 
protein is 376 amino acids in length, which is encoded by a 2676 base pair nucleotide 
sequence, beginning at nucleotide 104 and extending to a TGA stop codon at nucleotide 
1232. The human GDF-g protein is 375 amino acids and is encoded by a 2743 base pair 
sequence, with the open reading frame beginning at nucleotide 59 and extending to 
10 nucleotide 1 184. GDF-8 is also capable of fonning dimers, or heterodimers. with an 
expected molecular weight of approximately 23-30KD (see Example 4). For example, 
GDF-8 may form heterodimers with otiier femily members, such as GDF-1 1 . 

Also provided herein are the biologically active C-tenninal fragments of chicken (Figure 
2c) and rat (Figure 2d) GDF-8. The foil length nucleotide and deduced amino acid 

15 sequences for baboon, bovine, chicken, mt. ovine, porcine, and turkey are shown in 
Figures 14a-g and human and murine are shown in Figure 5. As shown in Figure 3b, 
alignment of the amino acid sequences of human, murine, rat and chicken GDF-8 
indicate that the sequences are 100% identical in the C-terminal biologically active 
fiagment. Figure 15 a and 15b also show the alignment of GDF-8 amino acid sequences 

20 for murine, rat, human, baboon, porcine, ovine, bovine, chicken and turkey. Given the 
extensive conservation of amino acid sequences between species, it would now be 
routine for one of skill in the art to obtain the GDF-8 nucleic acid and amino acid 
sequence for GDF-8 from any species, including those provided herein, as well as 
piscine, for example. 

25 Th. C-terminal region of GDF-8 following the putative proteolytic processing site shows 
significant homology to the known members of the TGF-p superfamily. The GDF-8 
sequence contains most of the residues that are highly conserved in other family 
members and in other species(see FIGURES 3a and 3b and 15 a and I5b). Like the 
TGF-ps and inhibin ps, GDF-8 contains an extra pair of cysteine residues in addition to 
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the 7 cysteines found in virtually all other family members. Among the known family 
members, GDF-8 is most homologous to Vgr-1 (45% sequence identity) (sec FIGURE 
4). 

Minor modifications of the recombinant GDF-8 primary amino acid sequence may result 
5 in proteins which have substantially equivalent activity as compared to the GDF-8 
polypeptide described herein. Such niodifications may be deliberate, as by site-directed 
mutagenesis, or may be spontaneous. All of the polypeptides produced by these 
modifications are included herein as long as the biological activity of GDF-8 still exists. 
Further, deletion of one or more amino acids can also result in a modification of the 
10 structure of the resultant molecule without significantly altering its biological activity. 
This can lead to the development of a smaller active molecule which would have broader 
utility. For example, one can remove amino or carboxy terminal amino acids which are 
not required for GDF-8 biological activity. 

The nucleotide sequence encoding the GDF-8 polypeptide of the invention includes the 
15 disclosed sequence and conservative variations thereof. The term "conservative 
variation" as used herein denotes the replacement of an amino acid residue by another, 
biolo^cally similar residue. Examples of conservative variations include the substitution 
of one hydrophobic residue such as isoleucine, valine, leucine or methionine for another, 
or the substitution of one polar residue for another, such as the substitution of arginine 
20 for lysine, glutamic for aspartic acid, or glutamine for asparagine, and the like. The term 
"conservative variation" also includes the use of a substituted amino acid in place of an 
unsubstituted parent amino acid provided that antibodies raised to the substituted 
polypeptide also inununoreact with the unsubstituted polypeptide, 

DNA sequences of the invention can be obtained by several methods. For example, the 
25 DNA can be isolated using hybridization techniques which are well known m the art. 
These include, but are not limited to: I) hybridization of genomic or cDNA libraries with 
probes to detect homologous nucleotide sequences, 2) polymerase chain reaction (PGR) 
on genomic DNA or cDNA using primers capable of annealing to the DNA sequence of 
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iotere... and 3) a„,ib«i, .cre=„i„, „f ,,p,,,i„„ ^^.^ ^^^^^ 
fragments with shared structural features. 

Preferably fte OOF-S p„,y„„e,eo,idc of *e i„ve„,l„„ U denved fo. a „a™,aU^ 

..de. fa.„cl«c„,.„rt=ey, fi^aad other species are also toc,udedhere,„. Screening 

P-ednreswiJchreiyonnucleic acid hybridizaiion make i.pK,.ib,etois„,a,ean,gen! 
sequence any organism, provided to app,.pda,e probe is avdlable. Given 
extendve nuCeoUde and amino acid homology between species, i, wouid be rou^,. for 
^e of skill in d,c art ,o obtain polynucleotides encoding ODF-8 from any species - 
10 "■'«P»bes.«hichco,xespo„d»apa..of,hese,„cncee„c„diugto 

.n quastio,. can be synthesized chemically. TOs requires that short, oligopepflde 
stretches of ami,» acid sequence must be knowu. The DNA sequence encoding the 

pmt^ncanbededucedtathe genetiecode. however, thedegeneracyofthecodemust 
be taken into account. It is possible to perform a mixed addition reaction when the 
«que„ce is degenetate. Ms includes a hete^goaeous mixture of denatured dou^ 
ble^tranded DNA. For such screemng, hybddization is preferably perfonned on eititer 

smgle.st,andedDNAordeua„ueddoubl=-s«andedDNA.„ybridizationisparticularly 
usefi.1 m the detection of cDNA clones derived fmm sources where an ex^mely low 
amount of mRNA sequ^ces .tat^g .„ ^ ^ ^ ^ 

wo,d.byusi„gs,nngenthybddi^onconditionsdirectedtoav„idnon.pecificbi„dtag 
" .s posstble. for example, to allow d» aut^adiogmphic visuali^on of a specific cDNA 
Clone by the hybridization of the Uugct DNA .0 that single ptobe in the mixture which 
IS tts complete complement (Wallace, el a!.. Nucl. Acid Res. 9:879. 1981). 

The development of specific DNA se,u»ces encoding ODF-8 can also be obtained by 
) '-'^'■0"<'f<l«uble.stt™dedDNAs.quencesfiomthege„omicDNA;2)chemical' 

■merest, and 3) „ „„„ „f ^ doublestra«ied DNA sequence by reverse 

ti>»scription of mRNA isolate fem , eukatyotic donor cell. In the latter case a 
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double-stranded DNA complement of mRN A is eventually formed which is generally 
referred to as cDNA. 

Of the three above-noted methods for developing specific DNA sequences for use in 
recombinant procedures, the isolation of genomic DNA isolates is the least conunon. 
5 This is especially true when it is desirable to obtain the microbial expression of 
manunalian polypeptides due to the presence of introns. 

The synthesis of DNA sequences is frequently the method of choice when the entire 
sequence of amino acid residues of the desired polypeptide product is known. When the 
entire sequence of amino acid residues of the desired polypeptide is not knovm, the direct 

10 synthesis of DNA sequences is not possible and the method of choice is the synthesis of 
cDNA sequences. Among the standard procedures for isolating cDNA sequences of 
interest is the formation of plasmid- or phage-carrying cDNA libraries which are derived 
from reverse transcription of mRN A which is abundant in donor cells that have a high 
level of genetic expression. When used in combination with polymerase chain reaction 

1 5 technology, even rare expression products can be cloned. In those cases where significant 
portions of the amino acid sequence of the polypeptide are known, the production of 
labeled single or double-stranded DNA or RN A probe sequences duplicating a sequence 
putatively present m the target cDNA may be employed in DNA/DNA hybridization 
procedures which are carried out on cloned copies of the cDNA which have been 

20 denatured into a single-stranded form (Jay, etaL, Nucl Acid Res., ii:2325, 1983). 

A cDNA expression library, such as lambda gtl I, can be screened indirectly for GDF-8 
peptides having at least one epitope, using antibodies specific for GDF-8. Such 
antibodies can be either polyclonally or monoclonally derived and used to detect 
expression product indicative of the presence of GDF-8 cDN A. 

25 In nucleic acid hybridization reactions, the conditions used to achieve a particular level 
of stringency will vary, depending on the nature of the nucleic acids being hybridized. 
For example, the length, degree of complementarity, nucleotide sequence composition 
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(e.g. GC V. AT content), and nucleic acid type (e.g., RNA v. DNA) of the hybridizing 
regions of the nucleic acids can be considered in selecting hybridization conditions An 
addmonal consideration is whether one of the nucleic acids is immobilized, for example 
on a filter. ^ ' 



5 An example of progressively higher stringency conditions is as follows: 2 x SSOO 1% 

SDSatabout,ootntempe,a«ne(h,bridi2aUoncondi,i„ns);0.2xSSC/0.1%SDS at about 
room temperature (low stringency conditions); 0.2 x SSao.1% SDS at about 42-.C 
(moderate stringency conditions); and 0,1 x SSC a, about 68"C (hi^ stringency 
condmons). Washing.™ be carried out using only one of these condition... ^ high 
.0 st,tnge.^condi.io„s.oreachof,he»nditions can be used. for 10-15 minuteseach 

mtheorder,is,edabove,repeadngany oral! of the steps listed However, as mentioned 
above, optimal conditions will vaty, depending on dte particular hybridizarion .action 

mvolved, and can be determined empirically. 

DNA sequences encoding GDF-8 can be expressed » by DNA transfer into a 

.5 --"^ho^'celL-HostceUs-arecellsinwhichavectorcanbeptopagatedanditsDNA 
expressed Tie tenn also includes any progeny of dte subject host cell, a is understood 
.hat all progeny may no. be idenUcal ,o the parental cell since the,, may be mutations 
tot occur during replicadon However, such progeny are included when ,he tenn "host 

cell-.used. Mett,odsofs.ablettansfer,meanin,tha.thefo.ignDNAiscon.inuously 
20 "aintomedm the host, are known in the an. 

In .he present invention, dre ODF-8 polynucleotide sequences may be inserted tato a 
recombtnant expression vector. Tie tenn "recombinant expression vector" refe.. to a 
Plasmid, vims or other vehicle known in the ar, that has been mampuUted by insertion 
or tncorpotadon of the ODF-8 genedc ^uences. Such expression vectors contain a 

promoter sequence which faciUtates the efficient transcription of the inserted genetic 
sequence of the host. TTte expression vector typically contains an ori^ of repUca.ion 
a ptomoter, as weU as specific g«,es which allow phe™„ypic selection of the fans-' 
te-edcell^Vectorssuitableforuseinthepresen, invention include, butarenot limited 
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to the T7-based expression vector for expression in bacteria (Rosenberg, et al. Gene, 
56:125, 1987), the pMSXND expression vector for expression in mammalian cells (Lee 
and Nathans, 1 Biol. Chem., 263:3521, 1988) and baculovirus-derived vectors for 
expression in insect cells. The DNA segment can be present in the vector operably linked 
5 to regulator)' elements, for example, a promoter {e.g., T7, metallothionein 1, or 
polyhedrin promoters). 

Polynucleotide sequences encoding GDF-8 can be expressed in either prokaryotes or 
eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms. 
Methods of expressing DNA sequences having eukaryotic or viral sequences in 

10 prokaryotes are well known in the art. Biologically functional viral and plasmid DNA 
vectors capable of expression and replication in a host are known in the art. Such vectors 
are used to incorporate DNA sequences of the invention. Preferably, the mature 
C-terminal region of GDF-8 is expressed from a cDNA clone containing the entire 
coding sequence of GDF-8. Altematively, the C-terminal portion of GDF-8 can be 

15 expressed as a fusion protein with the pro- region of another member of the TGF-p 
family or co-expressed with another pro-region (see for example, Hammonds, et al, 
Molec, Endocrin., 5:U9, 1991; Gray, A., and Mason, A., Science, 247:1328, 1990). 

Transformation of a host cell with recombinant DNA may be carried out by conventional 
techniques as are well known to those skilled in the art. Where the host is prokaryotic, 
20 such as £. coliy competent cells which are capable of DNA uptake can be prepared from 
cells harvested after exponential growth phase and subsequently treated by the CaCl2 
method using procedures well known in the art. Altematively, MgClj or RbCl can be 
used. Transformation can also be performed after forming a protoplast of the host cell if 
desired. 

25 When the host is a eukaryote, such methods of transfection of DNA as calcium phosphate 
co-precipitates, conventional mechanical procedures such as microinjection, e- 
lectroporation, insertion of a plasmid encased in liposomes, or virus vectors may be used. 
Eukaryotic cells can also be cotransformed with DNA sequences encoding the GDF-8 
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of the invention, and a second foreign DNA molecule encoding a selectable phenotype, 
such as the herpes sunplex thymidine kinase gene. Another method is to use a eukaiyotic 
viral vector, such as simian virus 40 (SV40) or bovine papilloma virus, to transiently 
infect or transform eukaiyotic ceils and express the protein, (see for example, Eukaryotic 
Viral Vectors, Cold Spring Harbor Laboratory, Gluzman ed., 1 982). 

Isolation and purification of microbial expressed polypeptide, or fragments thereof, 
provided by the invention, may be carried out by conventional means including 
preparative chromatography and immunological separations involving monoclonal or 
polyclonal antibodies. 



10 GDF-8 Anti bodies nnH Methods n fJhi> 

The invention includes antibodies immunoreactive with GDF-g polypeptide or functional 
ftagments thereof Antibody which consists essentially of pooled monoclonal antibodies 
with different epitopic specificities, as well as distinct monoclonal antibody preparations 
are provided. Monoclonal antibodies are made from antigen containing fragments of the 

15 protein by methods well known to those skilled in the art (Kohler, et al, Nature, 
256:495, 1975). The term antibody as used in this invention is meant to include intact 
molecules as well as fragments thereof, such as Fab and F(ab')„ Fv and SCA fragments 
which are capable of binding an epitopic determinant on GDF-8. 

(1) An Fab fragment consists of a monovalent antigen-binding fragment of an antibody 

20 molecule, and can be produced by digestion of a whole antibody molecule with the 

enzyme papam, to yield a fragment consisting of an intact light chain and a portion of a 
heavy chain. 

(2) An Fab' fi^gment of an antibody molecule can be obtained by treating a whole 
antibody molecule with pepsm. followed by reduction, to yield a molecule consistmg of 

25 an intact light chain and a portion of a heavy chain. Two Fab* fragments are obtained per 
antibody molecule treated m this manner. 
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(3) An (Fab% fragment of an antibody can be obtained by treating a whole antibody 
molecule with the enzyme pepsin, without subsequent reduction. A (Fab')2 fragment is 
a dimer of two Fab* fragments, held together by two disulfide bonds. 

(4) An Fv fragment is defined as a genetically engineered fragment containing the 
5 variable region of a light chain and the variable region of a heavy chain expressed as two 

chains. 

(5) A single chain antibody ("SCA") is a genetically engineered single chain molecule 
containing the variable region of a light chain and the variable region of a heavy chain, 
linked by a suitable, flexible polypeptide linker. 

10 As used in this invention, die tenn "epitope" refers to an antigenic detemiinant on an 
antigen, such as a GDF-8 polypeptide, to which the paratope of an antibody, such as an 
GDF-8-specific antibody, binds. Antigenic determuiants usually consist of chemically 
active surBice groupings of molecules, such as amino acids or sugar side chains, and can 
have specific three-dimensional structural characteristics, as well as specific charge 

15 characteristics. 



As is mentioned above, antigens that can be used in producing GDF-8-specific antibodies 
include GDF-8 polypeptides or GDF-8 polypeptide fiagments. The polypeptide or 
peptide used to immunize an animal can be obtained by standard recombinant, chemical 
synthetic, or purification methods. As is well known in the art, in order to increase 
20 immunogenicity, an antigen can be conjugated to a carrier protein. Commonly used 
carriers include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin 
(BSA), and tetanus toxoid. The coupled peptide is then used to immunize the animal 
(e.g., a mouse, a rat, or a rabbit). In addition to such carriers, vvell known adjuvants can 
be administered with the antigen to facilitate induction of a strong immune response. 

25 The term "cell-proliferative disorder" denotes malignant as well as non-malignant cell 
populations which often appear to dififer from the surrounding tissue both morphologi- 
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can, and genoWcaUy. Malignan, ee„s iu^,^ develop as a .esuUof a ™,u.ep 
procss. r,. ODF-8 p„,„ae . an a„,.e.e ™,ee„,e „, ^ 

0nesuchd^rieri3a„«„8„a„,ce,p„,ue^,,,,3^„_,„^_^^,^ «^8'™>- 

^ invendon p„vides a .e,hod to. detecting a ceU prCUe^aUve di«>^er of .ascle or 
a^po.«^wM*co„p.se.co„.aoa„,a„a„«^0,3„,^,„,^^ 
10 ornvog a ODM as^oeiated diao* and detecting binding ,o d. antibodf^t 

nd,ng to ODF-8. For pn,p„ae. of tte invention, an a„,ib«,y ^jfl, ^^^^ 
Pol^eptide n., be to detect tbe levei ofODF-, i„ ,io,ogic^ ^ ^ 

-u=ed.Apre6„edI^ 

w^**e level ,n.no„n^ceU.ode.enninev*ed,er.l,e^bjec.ha.aODF-8-assoeia.ed 

c.np„.fe.ativcdiaotder.Sncb„„.Hod.„fdetec.iona...„3eM„aing^^^^^^ 
h^n^^on U, detect the level of ODF^ ^ . „ ,„ ^ 

ODF^ gene. Prefembly the subject ia human. 

20 T^antibodies of tite invention can be used in any .bjec, in which it ia desitable to 

actannatet^*„or,„v,Voinnnunodiagnoaisotinn„„no.hetapy.T.ean.ihod^^ 
invention are suited for use (ornr.m„u ■ ■ "ooaies ot the 

in liauid nha, ; 7*"™°""°' «*fcb they can be utifed 
n l.,u,d phase or bound to a solid phase earner, h addition, the antibodies in the. 
ut^ntnnoassays can t. deteeUbly labeled in vartous ways. Exantples of t^e!l 
25 ^unoassays which can utili. antibodies of tite u,vention are c«pe.itit Jl 
.on-con^nve hnntunoaasays in eidter a ditect or indirect fonnat. Exantples „ sZ 
-».oassa^are««.adiotan.™oassay(RK,and -He sandwich (i„nn.a.onI«l ^1 
De.ect.on of the antigens using the antiU^. of the htvention can U don^ Z 
^.unoassays which are run in either tite forward, reve.., or aintultaneoua 
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including immunohistochemical assays on physiological samples. Those of skill in the 
art will know, or can readily discern, other immunoassay formats without undue 
experimentation. 

The antibodies of the invention can be bound to many different carriers and used to 
5 detect the presence of an antigen comprising the polypeptide of the invention. Examples 
of well-known carriers include glass, polystyrene, polypropylene, polyethylene, dextran, 
nylon, amylases, natural and modified celluloses, polyacrylamides, agaroses and 
magnetite. The nature of the carrier can be either soluble or insoluble forpuiposes of the 
invention. Those skilled in the art will know of other suitable carriers for binding 
10 antibodies, or will be able to ascertain such, using routine experimentation. 

There are many different labels and methods of labeling known to those of ordinary skill 
in the art. Examples of the types of labels which can be used in the present invention 
include enzymes, radioisotopes, fluorescent compounds, colloidal metals, 
chemiluminescent compoimds, phosphorescent compounds, and bioluminescent 
15 compounds. Those of ordinary skill in the art will know of other suitable labels for 
binding to the antibody, or will be able to ascertain such, using routine experimentation. 

Another technique which may also result in greater sensitivity consists of coupling the 
antibodies to low molecular weight haptens. These haptens can then be specifically 
detected by means of a second reaction. For example, it is common to use such haptens 
20 as biotin, which reacts with avidin, or dinitrophenyi, puridoxal, and fluorescein, which 
can react with specific antihapten antibodies. 

In using the monoclonal antibodies of the invention for the in vivo detection of antigen, 
the detectably labeled antibody is given a dose which is diagnostically effective. The 
temi "diagnostically effective" means that the amount of detectably labeled monoclonal 
25 antibody is administered in sufficient quantity to enable detection of the site having the 
antigen comprising a polypeptide of the mvention for which the monoclonal antibodies 
are specific. 
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The concentration of detectably labeled monoclonal antibody which is administe^d 
should be sufficient such that the binding to those cells having the polypeptide is 
detectable compa:^ to the background. Further, it is desirable that the detectably labeled 
monoclonal antibody be rapidly cleared fron. the circulatory system in order to give the 
5 best target-to-background signal ratio. 

As a rule, the dosage of detectably labeled monoclonal antibody for in vivo diagnosis will 

va^depending on such factors as age, sex, and extcntofdiseaseofthe individual Such 
dosages may vary, for example, depending on whether multiple injections are given 
antigemc burden, and other factoid known to those of skill in the art. 

10 For in vivo diagnostic imaging, the type of detection instrument available is a major 
factor m selecting a given radioisotope. H^e radioisotope chosen must have a type of 
decay which is detectable for a given type of instrument. Still another important factor 
m selecting a radioisotope for in vivo diagnosis is that deleterious radiation with respect 
to the host is minimized. IdeaUy. a radioisotope used for in vivo imaging will lack a 

15 particle emission, but produce a large number of photons in the 140-250 keV range 
which may readily be detected by conventional gamma cameras. 

For in vivo diagnosis radioisotopes may be bound to immunoglobulin either dix^ctly or 
indirectly by using an intermediate functional group, intennediate fimctional groups 
which often are used to bind radioisotopes which exist as metallic ions to i- 
20 mmunoglobulins are the bifonctional chelating agents such as 
diethylenetriaminepentacetic acid (DTPA) and ethylenediaminetetraacetic acid (EDTA) 
and similar molecules. Typical examples of metallic ions which can be bound to the 
monoclonal antibodies of the invention are "'h,"Ru,«Ga,«*Ga, "As,"Zr and ^<"T1. 

nxe monoclonal antibodies of the invention can also be labeled with a paramagnetic 
25 isotope for purposes of in vivo diagnosis, as in magnetic resonance imaging (MRI) or 
electron spin resonance (ESR). In general, any conventional method for visualizing 
diagnostic imaging can be utilized. UsuaUy gamma and positron emitting radioiso^^^^^ 
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are used for camera imaging and paramagnetic isotopes for MRl. Elements which are 
particularly useful in such techniques include *"Gd,"Mn, '"Dy,"Cr, and ^^Fe. 

The monoclonal antibodies of the invention can be used in vitro and in vivo to monitor 
the course of amelioration of a GDF-8-associated disease in a subject. Thus, for example, 
5 by measuring the increase or decrease in the number of cells expressing antigen 
comprising a polypeptide of the invention or changes in the concentration of such antigen 
present in various body fluids, it would be possible to determine whether a particular 
therapeutic regimen aimed at ameliorating the GDF-8-associated disease is effective. The 
term "ameliorate" denotes a lessening of the detrimental effect of the GDF-8-associated 
1 0 disease in the subj ect receiving therapy. 

Additional Methods of Treatment and Diagnosis 

The present invention identifies a nucleotide sequence that can be expressed in an altered 
manner as compared to expression in a normal cell, therefore it is possible to design 
appropriate therapeutic or diagnostic techniques directed to this sequence. Treatment 

15 includes administration of a reagent which modulates activity. The term "modulate" 
envisions the suppression or expression of GDF-8 when it is over-expressed, or 
augmentation of GDF-8 expression when it is underexpressed. When a muscle- 
associated disorder is associated with GDF-8 overexpression, such suppressive reagents 
as antisense GDF-8 polynucleotide sequence, dominant negative sequences or GDF-8 

20 binding antibody can be introduced into a cell. In addition, an anti-idiotype antibody 
which binds to a monoclonal antibody which binds GDF-8 of the invention, or an epitope 
thereof, may also be used in the therapeutic method of the invention. Alternatively, when 
a cell proliferative disorder is associated with underexpression or expression of a mutant 
GDF-8 polypeptide, a sense polynucleotide sequence (the DNA coding strand) or GDF-8 

25 polypeptide can be introduced into the cell. Such muscle-associated disorders include 
cancer, muscular dystrophy, spinal cord injury, traumatic injury, congestive obstructive 
pulmonary disease (COPD), AIDS or cachecia. One of skill in the art can determine 
whether or not a particular therapeutic course of treatment is successful by several 
methods described herein (e.g., muscle fiber analysis or biopsy). Neurodegenerative 
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disorders are also envisioned as treated by the method of the invention. In addition, the 
metlaod of the invention can be used in the treatment of obesity or of disorders related to 
abnormal proliferation of adipocytes. One of skill in the art can determine whether or not 
a particular therapeutic course of treatment is successful by sevei^l methods described 
5 herein {e.g., muscle fiber analysis or biopsy; determination of fat content). Tlie present 
examples demonstrate that the methods of the invention are usefiil for decreasing fat 
content, and therefore would be useful in the treatment of obesity and related disorders 
(e.g., diabetes). Neurodegenerative disorders are also envisioned as treated by the 
method of the invention. 

10 Thus, where a cell-proliferative disorder is associated with the expression of GDF-8 
nucleic acid ^uences that interfere with GDF-8 expression at the translation^ level can 
be used. This approach utilizes, for example, antisense nucleic acid and ribozymes to 
block translation of a specific GDF-8 mRNA, either by masking that mRNA with an 
antisense nucleic acid or by cleaving it with a ribozyme. Such disorders include 

15 neurodegenerative diseases, for example. In addition, dominant-negative GDF-8 mutants 
would be useful to actively interfere with function of "nomial" GDF-8. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least 
aportion of a specific mRNA molecule (Weintraub, Scientific American, 262:40, 1990). 
In the cell, the antisense nucleic acids hybridize to the ccitesponding mRNA, forming 
20 adouble-stianded molecule. n.e antisense nucleic acids interfere with the translation of 



the mRNA, since the cell wUl not translate a mRNA that is double- 



stranded. 



Antisense oligomers of about 15 nucleotides are preferred, since they are easily 
synthesiaed and are less Ukely to cause problems than larger molecules when introduced 
into the target GDF-8-producing cell. The use of antisense methods to inhibit the in vitro 

25 translationofgenesiswellknownintheart(Man:us-Sakura,^«a/.5/oc/,em 172-289 
1988y ' ■ ' 
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Ribozymes are RMA molecules possessing the ability to specifically cleave other 
single-stranded RNA in a manner analogous to DNA restriction endonucleases. Througli 
the modification of nucleotide sequences which encode these RNAs, it is possible to 
engineer molecules that recognize specific nucleotide sequences in an RNA molecule and 
5 cleave it (Cech, J. Amer Med Assn., 260:3030, 1988). A major advantage of this 
approach is that, because they are sequence-specific, only mRNAs v/iih particular 
sequences are inactivated. 

There are two basic types of ribozymes namely, tetrahymena-type (Hasselhoff, Nature, 
334:585, 1988) and "hammerhead"-type. Tetrahymena-type ribozymes recognize 

1 0 sequences which are four bases in length, while "haiiunerhead"-type ribozymes recognize 
base sequences 1 1-18 bases in length. The longer the recognition sequence, the greater 
the likelihood that the sequence will occur exclusively in the target mRNA species. 
Consequently, hammerhead-type ribozymes are preferable to . tetrahymena-type 
ribozymes for inactivating a specific mRNA species and 1 8-based recognition sequences 

15 are preferable to shorter recognition sequences. 

In another embodiment of the present invention, a nucleotide sequence encoding a GDF- 
8 dominant negative protein is provided. For example, a genetic construct that contain 
such a dominant negative encoding gene may be operably linked to a promoter, such as 
a tissue-specific promoter. For example, a skeletal muscle specific promoter (e.g., 
20 human skeletal muscle a-actin promoter) or developmentally specific promoter (e.g., 
MyHC 3, which is restricted in skeletal muscle to the embryonic period of development, 
or an inducible promoter (e.g., the orphan nuclear receptor TISl). 

Such constructs are useful in methods of modulating a subject's skeletal mass. For 
example, a method include transforming an organism, tissue, organ or cell with a genetic 
25 construct encoding a dominant negative GDF-8 protein and suitable promoter in operable 
linkage and expressing the dominant negative encoding GDF-8 gene, thereby modulating 
muscle mass by interfering with wild-type GDF-8 activity. 
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GDF-8 most likely foims dimers, homodimei^ or heterodimers and may even form 
heterodimer.withotherGDFfamilymembei.,suchasGDF-Il (..e Example 4) Hence 
while not wanting to be bound by a particular theory, the dominant negative effect' 
described herein may involve the formation of non-functional homodimers or h- 
5 eterodnners of dominant negative and wild-tjT,e GDF-8 monomers. More specifically 
It IS possible that any non-fbnctional homodimer or any heterodimer formed by the 
dimerization of wild-type and/or dominant negative GDF-8 monomers produces a 
dominant effect by: 1) being sj^thesized but not processed or seci^ted; 2) inhibiting the 
secretion of vvild type GDF-8; 3) pi^venting normal proteolytic cleavage of the 
10 preprotein thereby producing a nonfimctional GDF-8 molecule; 4) altering the affinity 
of the non-functional dimer (e.g.. homodimeric or heterodimeric GDF-8) to a receptor 
or generating an antagonistic form of GDF-8 that binds a receptor without activating it; 
or 5) inhibitmg the intracellular processing or secretion of GDF-8 related or TGF-B 
family proteins. 

15 Non-functional GDF-8 can function to inhibit the growth regulating actions of GDF-8 
on muscle cells that include a dominant negative GDF-8 gene. Deletion or missense 
dominant negative forms of GDF-8 that retain the ability to form doners with wild- type 
GDF-8 protein but do not function as wild-type GDF-8 proteins may be used to inhibit 
the biological activity of endogenous wild- type GDF-8. For example, in one 
20 embodiment, the proteolytic processing site of GDF-8 may be altered {e.g. deleted) 
resulting in a GDF-8 molecule able to undergo subsequent dimerization with endogenous 
wild-type GDF-8 but unable to undergo further processing into a mature GDF-8 form 
Alternatively, a non-fimctional GDF-8 can function as a monomeric species to inhibit 
the growth regulating actions of GDH-8 on muscle cells. 

25 Any genetic recombinant method in the art may be used, for example, recombinant 
vmises may be engineered to express a dominant negative fomi of GDF-8 which may be 
used to inhibit the activity of wild-type GDF-8. Such viruses may be used 
therapeutically for treatment of diseases resulting from aberrant over-expression or 
activity of GDF-8 protein, such as in denervation hypertrophy or as a means of 
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controlling GDF-8 expression when treating disease conditions involving muscle, such 
as in musculodegenerative diseases or in tissue repair due to trauma or in modulating 
GDF-8 expression in animal husbandry (e.g., transgenic animals for agricultural 
purposes). 

5 The invention provides a method for treating a muscle or adipose tissue disorder in a 
subject. The method includes administering a therapeutically effective amount of a 
GDF-8 agent to the subject, thereby inhibiting abnormal grov/th of muscle or adipose 
tissue. The GDF-8 agent may include a GDF-8 antisense molecule or a dominant 
negative polypeptide, for example. A ^therapeutically effective amount" of a GDF-8 
10 agent is that amount that ameliorates symptoms of the disorder or inhibits GDF-8 
induced growth of muscle, for example, as compared with a normal subject. 

Gene Therapy 

The present invention also provides gene therapy for the treatment of cell proliferative 
or immunologic disorders which are mediated by GDF-8 protein. Such therapy would 

15 achieve its therapeutic effect by introduction of the GDF-8 antisense or dominant 
negative encoding polynucleotide into cells having the proliferative disorder. Delivery 
of antisense or dominant negative GDF-8 polynucleotide can be achieved using a 
recombinant expression vector such as a chimeric virus or a colloidal dispersion system. 
Especially preferred for therapeutic delivery of antisense or dominant negative sequences 

20 is the use of targeted liposomes. In contrast, when it is desirable to enhance GDF-8 
production, a "sense" GDF-8 polynucleotide or fimctional equivalent (e.g., the C-term 
active region) is introduced into the appropriate cell(s). 

Various viral vectors which can be utilized for gene therapy as taught herein include 
adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. 
25 Preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples 
of retroviral vectors in which a single foreign gene can be inserted include, but are not 
limited to: Moloney munne leukemia vuns (MoMuLV), Harvey murine sarcoma virus 
(HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). 
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A number of additional retroviral vectors can incorporate multiple genes. All of these 
vectors can transfer or incorporate a gene for a selectable marker so that transduced cells 
can be identified and generated. By inserting a GDF-8 sequence of interest into the viral 
vector, along with another gene which encodes the ligand for a receptor on a specific 
5 target cell, for example, the vector is now target specific. Retroviral vectors can be made 
target specific by attaching, for example, a sugar, a glycolipid, or a protein. Preferred 
targeting is accomplished by using an antibody to target the retroviral vector. Those of 
skill in the art will know of, or can readily ascertain without undue experimentation 
specific polynucleotide sequences which can be inserted into the retroviral genome or 
10 attached to a viral envelope to allow target specific delivery of the retroviral vector 
containing the GDF-8 antisense polynucleotide. 

Since recombinant retroviruses are defective, they require assistance in order to produce 
infectious vector particles. This assistance can be provided, for example, by usmg helper 
cell hues that contain plasmids encoding all of the strtictural genes of the retrovirus mrder 

15 the control of regulatory sequences within the LTR. TTiese plasmids are missing a 
nucleotide sequence which enables the packaging mechanism to recognize an RNA 
transcript for encapsulation. Helper cell lines which have deletions of the packaging 
signal include, but are not limited to ^2, PA317 and PA12, for example. These cell lines 
produce empty virions, since no genome is packaged. If a retroviral vector is introduced 

20 into such cells in which the packaging signal is intact, but the structural genes are 
replaced by other genes of interest, the vector can be packaged and vector virion 
produced. 



Alteniatively, NIH 3T3 or other tissue culture cells can be directly transfected with 
plasmids encoding the retroviral structural genes gag, pol and env, by conventional 
25 calcium phosphate transfection. Ihese cells are then transfected with the vector plasmid 
containing the genes of interest. Hie resulting cells release the retroviral vector into the 
culture medium. 
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Another targeted delivery system for GDF-8 polynucleotides is a colloidal dispersion 
system. Colloidal dispersion systems include macromolecule complexes, nanocapsulcs, 
microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, 
mixed micelles, and liposomes. The preferred colloidal system of this invention is a 
5 liposome. Liposomes are artificial membrane vesicles which are useful as delivery 
vehicles in vitro and in vivo. It has been shown that large unilamellar vesicles (LUV), 
which range in size from 0.2-4.0 |im can encapsulate a substantial percentage of an 
aqueous buffer containing large macromolecules. RNA, DNA and intact virions can be 
encapsulated within the aqueous mterior and be delivered to cells in a biologically active 

10 form (Fraley, et al, Trends Biochem, Sci,, 6:77, 1981). hi addition to mammalian cells, 
liposomes have been used for delivery of polynucleotides in plant, yeast and bacterial 
cells, in order for a liposome to be an efiBcient gene transfer vehicle, the following 
characteristics should be present: (1) encapsulation of the genes of interest at high 
efficiency while not compromising their biological activity; (2) preferential and 

15 substantial binding to a target cell in comparison to non-target cells; (3) delivery of tlie 
aqueous contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) 
accurate and effective expression of genetic information (Manning, et al, Biotechniques, 
6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, particularly 
20 high-phase-transition-temperature phospholipids, usually in combination with steroids, 
especially cholesterol. Other phospholipids or other lipids may also be used. The physical 
characteristics of liposomes depend on pH, ionic strength, and the presence of divalent 
cations. 

Examples of lipids useful in Uposome production include phosphatidyl compounds, such 
25 a s phosphatidyiglycerol, phosphatidylcholine, phosphatidylserine, 
phosphatidylethanolamine, sphingohpids, cerebrosides, and gangliosides. Particularly 
useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative 
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Phospholipids include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
distearoylphosphatidylcholine. 

The targeting of liposomes can be classified based on anatomical and mechanistic 
factors. Anatomical classification is based on the level of selectivity, for example 
5 organ-specific, cell-specific, and organelle-specific. Mechanistic targeting can be' 
distrnguished based upon whether it is passive or active. Passive targeting utilizes the 
natural tendency of liposomes to distribute to cells of the reticulo-endothelial system 
(RES) m organs which contain sinusoidal capillaries. Active targeting, on the other hand 
involves alteration of the liposome by coupUng the liposome to a specific ligand such as 
10 a monoclonal antibody, sugar, glycolipid, or pK>tein, or by changing the composition or 
size of the liposome in order to achieve targeting to organs and cell types other than the 
naturally occurring sites of localization. 

The surface of the targeted delivery system may be modified in a variety of ways In the 
case of a liposomal targeted delivery system, lipid groups can be incorporated into the 
1 5 hpid bilayer of the liposome in order to maintain the targeting ligand in stable association 
with the liposomal bilayer. Various Imking groups can be used for joining the lipid 
chains to the targeting ligand. 

Due to the expression of GDF-8 in muscle and adipose tissue, there are a variety of 
applications using the polypeptide, polynucleotide, and antibodies of the invention 

20 related to these tissues. Such ^plications include treatment of cell proliferative disorders 
mvolvmg these and other tissues, such as neural tissue. In addition, GDF-8 may be usefiil 
in various gene therapy procedures. In embodiments where GDF-8 polypeptide is 
administered to a subject, the dosage range is about 0.1 ugOcg to 100 mg/kg- more 
preferably fi-om about 1 ug/kg to 75 mg/kg and most preferably fiom about 10 mgAcg to 

25 50 mg/kg. ^ 
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Chromosomal Location o f GDF-8 

The data in Example 6 shows that the human GDF-8 gene is located on chromosome 2. 
By comparing the chromosomal location of GDF-8 with the map positions of various 
human disorders, it should be possible to determine whether mutations in the GDF-8 
5 gene are involved in the etiology of human diseases. For example, an autosomal 
recessive form of juvenile amyotrophic lateral sclerosis has been shown to map to 
chromosome 2 (Hentati, ei ai, Neurology, 42 [SuppL3]:201, 1992). More precise 
mapping of GDF-8 and analysis of DNA from these patients may indicate that GDF-8 
is, in fact, the gene affected in this disease. In addition, GDF-8 is usefiil for distmguish- 
10 ing chromosome 2 from other chromosomes. 

Transgenic Animals and Methods ofMakins the same 

Various methods to make the transgenic animals of the subject invention can be 
employed. Generally speaking, three such methods may be employed. In one such 
method, an embryo at the pronuclear stage (a "one cell embryo") is harvested from a 

15 female and the transgene is microinjected into the embryo, in which case the transgene 
will be chromosomally integrated into both the germ cells and somatic cells of the 
resulting mature animal. In another such method, embryonic stem cells are isolated and 
the transgene incorporated therein by electroporation, plasmid transfection or m- 
icroinjection, followed by remtroduction of the stem cells into the embryo where they 

20 colonize and contribute to the germ line. Methods for microinjection of mammalian 
species is described in United States Patent No. 4,873,1 91 . In yet another such method, 
embryonic cells are infected v^th a retrovirus containing the transgene whereby the germ 
cells of the embryo have the transgene chromosomally integrated therein. When the 
animals to be made transgenic are avian, because avian fertilized ova generally go 

25 through cell division for the first twenty hours in the oviduct, microinjection into the 
pronucleus of the fertilized egg is problematic due to the inaccessibility of the 
pronucleus. Therefore, of the methods to make transgenic animals described generally 
above, retrovirus infection is preferred for avian species, for example as described in U.S. 
5,162,215. If microinjection is to be used with avian species, however, a recently 

30 published procedure by Love et al, (Biotechnology, 12, Jan 1994) can be utilized 
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whereby the embryo is obtained from a sacrificed hen approximately two and one-half 
hours after the laying of the previous laid egg, the transgene is microinjected into the 
cytoplasm of the germinal disc and the embryo is cultured in a host shell until maturity. 
When the animals to be made transgenic are bovine or porcine, microinjection can be 
5 hampered by the opacity of the ova thereby making tlie nuclei difficult to identify by 
traditional differential interference-contrast microscopy. To overcome this problem, the 
ova can first be centrifuged to segregate the pronuclei for better visualization. 

The "non-human animals" of the invention bovine, porcine, ovine and avian animals 
(e.g., cow, pig, sheep, chicken, turkey). The "transgenic non-human animals" of the 

10 invention are produced by mtroducing "transgenes" into the germline of the non-human 
animal. Embryonal target cells at various developmental stages can be used to introduce 
transgenes. Different methods are used depending on the stage of development of the 
embryonal target cell. The zygote is the best target for micro-injection. The use of 
zygotes as a target for gene transfer has a major advantage in that m most cases the 

1 5 injected DNA will be incorporated into the host gene before the first cleavage (Brinster 
etal..Proc. Natl. Acad. Sci. C;S4 82:4438-4442, 1985). As a consequence, all cells of the 
transgenic non-human animal wUl cany the incorporated transgene. This will in general 
also be reflected in the efficient transmission of the transgene to ofifepring of the founder 
since 50% of the germ cells will harbor the transgene. 

20 The term "transgenic" is used to describe an animal which includes exogenous genetic 
material within aU of its cells. A "transgenic" animal can be produced by cross-breeding 
two chimeric animals which include exogenous genetic material within cells used in 
reproduction. Twenty-five percent of the resulting offspring will be transgenic i.e., 
animals which include the exogenous genetic material within all of their cells in both 

25 alleles. 50% of the resulting animals will include the exogenous genetic material within 
one allele and 25% will mclude no exogenous genetic material. 

In the microinjection method usefiil in the practice of the subject invention, the transgene 
is digested and purified fi^ee fiom any vector DNA e.g. by gel electrophoresis. It is 
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preferred that the transgene include an operatively associated promoter which interacts 
with cellular proteins involved in transcription, ultimately resulting in constitutive 
expression. Promoters useful in this regard include those from cytomegalovirus (CMV), 
Moloney leukemia virus (MLV), and herpes virus, as well as those from the genes 
5 encoding metallothionin, skeletal actin, P-enolpyruvate carboxylase (PEPCK), 
phosphoglycerate (PGK), DHFR, and thymidine kinase. Promoters for viral long 
terminal repeats (LTRs) such as Rous Sarcoma Virus can also be employed. When the 
animals to be made transgenic are avian, preferred promoters include those for the 
chicken P-globin gene, chicken lysozyme gene, and avian leukosis virus. Constructs 
10 usefiil in plasmid transfection of embryonic stem cells will employ additional regulatory 
elements well known in the art such as enhancer elements to stimulate transcription, 
splice acceptors, termination and polyadenylation signals, and ribosome binding sites to 
permit translation. 

Retroviral infection can also be used to introduce transgene into a non-human animal, as 

15 described above. The developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retro viral infection 
(Jaenich, R., Proc. Natl. Acad. Sci USA 73:1260-1264, 1976). EfiBcient infection of the 
blastomeres is obtained by enzymatic treatment to remove the zona pellucida (Hogan, et 
al (1986) in Manipulating the Mouse Embiyo, Cold Spring Harbor Labomtory Press, 

20 Cold Spring Harbor, N.Y.). The viral vector system used to introduce the transgene is 
typically a replication-defective retro virus carrying the transgene (Jahner, et al, Proc. 
Natl Acad. Sci. USA 82:6927-693 1, 1985; Van der Putten, et al, Proa Natl Acad Sci 
USA 82:6148-6152, 1985). Transfection is easily and eflFiciently obtained by culturing 
the blastomeres on a monolayer of virus-producing cells (Van der Putten, supra; Stewart, 

25 et al, EMBO J. 6:383-388, 1987). Alternatively, infection can be performed at a later 
stage. Virus or virus-producing cells can be injected into the blastocoele (D. Jahner et al, 
Nature 298:623-628, 1982). Most of the founders v^U be mosaic for the transgene since 
incoiporation occurs only in a subset of the cells which formed the transgenic nonhuman 
animal. Further, the founder may contain various retro viral insertions of the transgene 

30 at different positions in the genome which generally will segregate in the offepring. In 
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addition, it is also possible to introduce transgenes into the genn line, albeit with low 
efficiency, by intrauterine retroviral infection of the midgestation embryo (D. Jahner et 

al, supra). 



A third type of target cell for transgene introduction is the embiyonal stem cell (ES). ES 
5 cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos(M.J.Evanse/ a/. Afa/i/re 2^:154-156, 1981; M.O. Bradley era/., Naturem. 
255-258, 1984; Gossler, et al, Proc. Natl Acad. Sci USA 83: 9065-9069, 1986- and 
Robertson et al., Nature 322:445-448, 1986). Transgenes can be efficiently introduced 
into the ES cells by DNA transfection or by retro virus-mediated transduction. Such 
10 transformed ES cells can thereafter be combined with blastocysts fiom a nonhuman 
animal. The ES cells thereafter colonize the embryo and contribute to the germ Ime of 
the resulting chimeric animal. (For review see Jaenisch, R., Science 240" 1468-1474 
1988). 

"Transformed" means a cell into which (or into an ancestor of which) has been 

15 intmduced,bymeansofrecombinantnucleicacidtechniques,aheterologousnucleicacid 
molecule. "Heterologous" refers to a nucleic acid sequence that either originates from 
another species or is modified from either its origmal fomi or the form primarily 
expressed in the cell. 

•Transgene" means any piece of DNA which is inserted by artifice into a cell, and 
20 becomes part of the genome of the organism (i.e., either stably integrated or as a stable 
extrachromosomal element) which develops from that cell. Such a transgene may 
include a gene which is partly or entirely heterologous (/..., foreign) to the transgenic 
organism, or may represent a gene homologous to an endogenous gene of the organism 
Included within this definition is a transgene created by the providing of an RNA 
25 sequence which is transcribed into DNA and then incorporated into the genome. The 
transgenes of the invention mclude DNA sequences which encode GDF-8, and include 
GDF-sense. antisense, dominant negative encoding polynucleotides, which may be 
expressed in a transgenic non-human animal. TTie term "transgenic" as used herem 
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additionally includes any organism whose genome lias been altered by in vitro 
manipulation of the early embryo or fertilized egg or by any transgenic technology to 
induce a specific gene knockout. The term "gene knockout" as used herein, refers to the 
targeted disruption of a gene in vivo with complete loss of fiinction that has been 
5 achieved by any transgenic technology familiar to those in the art. In one embodunent, 
transgenic animals having gene knockouts are those in which the target gene has been 
rendered nonfunctional by an insertion targeted to the gene to be rendered non-functional 
by homologous recombination. As used herein, the term 'transgenic" includes any 
transgenic technology familiar to those in the art which can produce an organism 
1 0 carrying an introduced transgene or one in which an endogenous gene has been rendered 
non-fimctional or "knocked out " An example of a transgene used to "knockout" GDF-8 
function in the present Examples is described in Example 8 and FIGURE 12a. Thus, in 
another embodiment, the invention provides a transgene wherein the entire mature C- 
tenninal region of GDF-8 is deleted. 

15 The transgene to be used in the practice of the subject invention is a DNA sequence 
comprising a modified GDF-8 coding sequence. In a preferred embodiment, the GDF-8 
gene is disrupted by homologous targeting in embryonic stem cells. For example, the 
entire mature C-terminal region of the GDF-8 gene may be deleted as described in the 
examples below. Optionally, the GDF-8 disruption or deletion may be accompanied by 

20 insertion of or replacement with other DNA sequences, such as a non-fimctional GDF-8 
sequence. In other embodiments, the transgene comprises DNA antisense to the coding 
sequence for GDF-8. In another embodiment, the transgene comprises DNA encoding 
an antibody or receptor peptide sequence which is able to bind to GDF-8. The DNA and 
peptide sequences of GDF-8 are known in the art, the sequences, localization and activity 

25 disclosed in W094/21681 and pending United States patent application 08/033,923, filed 
on March 19, 1993, incorporated by reference in its entirety. The disclosure of both of 
these applications are hereby incorporated herein by reference. Where appropriate, DNA 
sequences that encode proteins having GDF-8 activity but differ in nucleic acid sequence 
due to the degeneracy of the genetic code may also be used herein, as may truncated 

30 forms, allelic variants and interspecies homologues. 
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The invention also includes animals having heterozygous mutations in GDF-8 or partial 
inhibition of GDF-8 function or expression. A heterozygote would exhibit an 
intennediate increase in muscle mass as compared to the homozygote as shown in Table 
4 below. In other words, partial loss of fimction leads to a partial increase in muscle 
5 mass. One of skill in the art would readily be able to determine if a particular mutation 
or if an antisense molecule was able to partially inhibit GDF-8. For example, in vitro 
testing may be desirable initially by comparison with wild-type or untreated GDF-8 (e.g. , 
comparison of northern blots to examine a decrease in expression). 

After an embryo has been microinjected, colonized with transfected embryonic stem cells 

10 or infected with a retrovirus containing the transgene (except for practice of the subject 
invention in avian species which is addressed elsewhere herein) the embryo is implanted 
into the oviduct of a pseudopregnant female. The consequent progeny are tested for 
incorporation of the transgene by Southern blot analysis of blood samples using 
transgene specific probes. PGR is particularly useful in this regard. Positive progeny 

15 (GO) are crossbred to produce ofifepring (Gl) which are analyzed for transgene 
expression by Northern blot analysis of tissue samples. To be able to distinguish 
expression of Uke-species transgenes from expression of the animals endogenous GDF-8 
gene(s), a marker gene fragment can be mciuded in the construct in the 3' untranslated 
region of the transgene and the Northern probe designed to probe for the marker gene 

20 fragment The senun levels of GDF-8 can also be measured in the transgenic animal to 
establish appropriate expression. Expression of the GDF-8 transgenes, thereby 
decreasing the GDF-8 in the tissue and serum levels of the transgenic animals and 
consequently increasing the muscle tissue content resuhs in the foodstufis from these 
animals (/. e. eggs, beef, pork, poultry meat, milk, etc. ) having markedly increased muscle 

25 content, and preferably without increased, and more preferably, reduced levels of fat and 
cholesterol. By practice of the subject invention, a statistically significant increase in 
muscle content, preferably at least a 2% increase in muscle content (e.g., in chickens), 
more preferably a 25% increase m muscle content as a percentage of body weight, more 
preferably greater than 40% increase in muscle content in these foodstufe can be 

30 obtained. 
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Additional Methods of Use 

Thus, the present invention includes methods for increasing muscle mass in domesticated 
animals, characterized by inactivation or deletion of the gene encoding growth and 
differentiation factor-8 (GDF-8). The domesticated animal is preferably selected from 
5 the group consisting of ovine, bovine, porcine, piscine and avian. The animal may be 
treated with an isolated polynucleotide sequence encoding growth and differentiation 
faclor-8 which polynucleotide sequence is also from a domesticated animal selected from 
the group consisting of ovine, bovine, porcine, piscine and avian. The present invention 
includes methods for increasing the muscle mass in domesticated animals characterized 
1 0 by admmistering to a domesticated animal monoclonal antibodies directed to the GDF-8 
polypeptide. The antibody may be an anti-GDF-8, and may be either a monoclonal 
antibody or a polyclonal antibody. 

The invention includes methods comprising using an anti-GDF-8 monoclonal antibody, 
antisense, or dominant negative mutants as a therapeutic agent to inhibit the growth 

15 regulating actions of GDF-8 on muscle cells. Muscle cells are defined to include fetal 
or adult muscle cells, as well as progenitor cells which are capable of differentiation into 
muscle. The monoclonal antibody may be a humanized (e.g. , either fully or a chimeric) 
monoclonal antibody, of any species origin, such as murine, ovine, bovine, porcine or 
avian. Methods of producing antibody molecules with various combinations of 

20 "humanized" antibodies are well known m the art and include combining murine variable 
regions with human constant regions (Cabily, et al Proc.NatlAcad.Scl USA, 81 :3273, 
1984), or by grafting the murine-antibody complementary determining regions (CDRs) 
onto the human framework (Richmann, et al, Nature 332:323, 1988). Other general 
references which teach methods for creating humanized antibodies include Morrison, et 

25 al, Science, 229:1202, 1985; Jones, et al, Nature, 321:522, 1986; Monroe, et al, Nature 
312:719, 1985; Oi, etal, BioTechniques, 4:214, 1986; European Patent Application No. 
302,620; and U.S. Patent No. 5,024,834. Therefore, by humanizing the monoclonal 
antibodies of the invention for in vivo use, an immune response to the antibodies would 
be greatly reduced. 
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The monoclonal antibody. GDF-8 polypeptide, or GDF-8 polynucleotide (all "GDF-8 
agents") may have the effect of increasing the development of skeletal muscles. In 
preferred embodiments of the claimed methods, the GDF-8 monoclonal antibody, 
polypeptide, or polynucleotide is administered to a patient suffering from a disorder 
5 selected from the group consisting of muscle wasting disease, neuromuscular disorder, 
muscle atrophy or aging. The GDF-8 agent may also be admmistered to a patient 
suffering from a disorder selected from the group consisting of muscular dystrophy, 
spinal cord injury, traumatic injury, congestive obstructive puhnonaiy disease (COPD),' 
AIDS or cachechia. In a preferred embodiment, the GDF-8 agent is administered to a 

10 patient with muscle wasting disease or disorder by intmvenous, intramuscular or 
subcutaneous injection; preferably, a monoclonal antibody is administered within a dose 
range between about 0.1 mg/kg to about 100 mg/kg; more preferably between about 1 
ug/kg to 75 mg/kg; most preferably from about 10 mg/kg to 50 mg/kg. The antibody 
may be administered, for example, by bolus injunction or by slow infrision. Slow 

15 infusionoveraperiodof30minutesto2hoursispreferred. The GDF-8 agent may be 
formulated in a formulation suitable for administration to a patient. Such formulations 
are known in the art. 



The dosage regimen will be determined by the attending physician considering various 
factors which modify the action of the GDF-8 protein, e.g. amount of tissue desired to 

20 be formed, the site of tissue damage, the condition of the damaged tissue, the size of a 
wound, type of damaged tissue, the patient's age, sex, and diet, the severity of any 
mfection, time of administration and other clinical factoid. The dosage may vary with 
the type of matrix used m the reconstitution and the types of agent, such as anti-GDF-8 
antibodies, to be used in the composition. Generally, systemic or injectable administra- 

25 tion. such as intravenous (IV), intramuscular (IM) or subcutaneous (Sub-Q) injection. 
Administration will generally be initiated at a dose which is minimally effective, and the 
dose will be mcreased over a preselected time course until a positive effect is observed 
Subsequently, incremental increases in dosage wUl be made limiting such incremental 
mcreases to such levels that produce a corresponding increase in effect. whUe taking into 

30 account any adverse affects that may appear. Hie addition of other known growth 
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factors, such as IGF I (insulin like growth factor I), human, bovine, or chicken growth 
hormone which may aid in increasing muscle mass, to the final composition, may also 
affect the dosage. In the embodiment where an anti-GDF-8 antibody is administered, the 
anti-GDF-8 antibody is generally administered within a dose range of about 0.1 ug/kg 
5 to about 100 mg/kg,; more preferably between about 1 0 nig/kg to 50 mg/kg. 

Progress can be monitored by periodic assessment of tissue growth and/or repair. The 
progress can be monitored, for example, x-rays, histomorphometric determinations and 
tetracycline labeling. 

Screenin2 for GDF-8 Modulating Compounds 

1 0 In another embodiment, the invention provides a method for identifying a compound or 
molecule that modulates GDF-8 protein activity or gene expression. The method 
includes incubating components comprising the compound, GDF-8 polypeptide or with 
a recombinant cell expressing GDF-8 polypeptide, under conditions sufficient to allow 
the components to interact and determining the effect of the compound on GDF-8 

1 5 activity or expression. The effect of the compoimd on GDF-8 activity can be measured 
by a number of assays, and may include measurements before and after incubating in the 
presence of the compound. Compounds that affect GDF-8 activity or gene expression 
include peptides, peptidomimetics, polypeptides, chemical compounds and biologic 
agents. Assays include Northern blot analysis of GDF-8 mRN A (for gene expression), 

20 Westem blot analysis (for protein level) and muscle fiber analysis (for protein activity). 

The above screening assays may be used for detecting the compounds or molecules that 
bind to the GDF-8 receptor or GDF-8 polypeptide, in isolating molecules that bind td the 
GDF-8 gene, for measuring the amount of GDF-8 in a sample, either polypeptide or RNA 
(mKNA), for identifying molecules that may act as agonists or antagonists, and the like. 
25 For example, GDF-8 antagonists are useful for treatment of muscular and adipose 
tissue disorders (e.g., obesity). 
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Incubating includes conditions which allow contact bet^veen the test compound and 
GDF-8 polypeptide or with a recombinant cell expressing GDF-8 polypeptide. 
Contacting includes in solution and in solid phase, or in a cell. The test compound may 
optionally be a combinatorial library for screening a plurality of compounds. Com- 
5 pounds identified in the method of the invention can be further evaluated, detected, 
cloned, sequenced, and the like, either in solution or after binding to a solid support, by 
any method usually applied to the detection of a specific DNA sequence such as PGR, 
oligomer restriction (Saiki, et al. Bio/Technology, 3:1008-1012. 1985), allele-specific 
oligonucleotide (ASO) probe analysis (Cornier, et al. Proc. Natl. Acad. Sci. USA, 
10 80:278, 1983), oligonucleotide Landegren, etal. Science, 241:1077, 1988), and the like. 
Molecular techniques for DNA analysis have been reviewed (Landegren, et al, Science, 
242:229-237,1988). 

All references cited herein are hereby incorporated by reference in their entirety. 

The foUowing examples are intended to illustrate but not limit the invention. While they 
15 are typical of those that might be used, other procedures known to those skilled in the art 
may alternatively be used. 



EXAMP1.F 1 

IDENTIFICATION AND ISOLATION OF A NOVFt 
TGF-B FAM ILY MF.]VraFP 

20 To identify a new member of the TGF-p superfamily, degenerate oligonucleotides were 
designed which corresponded to two conserved regions among the known family 
members: one region spanning the two tryptophan residues conserved in all family 
members except MIS and the other region spamung the invariant cysteine residues near 
the C-terminus. These primers were used for polymerase chain reactions on mouse 

25 genomic DNA followed by subcloning the PGR products using restriction sites placed 
at the 5- ends of the primers, picking individual E. coli colonies carrying these subcloned 



wo 98/33887 



PCT/US98/02479 



- 41 - 

inserts, and using a combination of random sequencing and hybridization analysis to 
eliminate known members of the superfamily. 

GDF-8 was identified from a mixture of PGR products obtained with the primers 
SJL14 1 : 5'-CCGGAATTCGGITGG(G/C/A)A(G/AA'/C)(A/G)A(T/C)TGG(A/G)Tl 
5 (AyG)TICr/G)CICC-3' (SEQ ID NO: 1) 

SJL147: 

5'-CCGGAATTC(G/A)CAI(G/C)C(G/A)CA(G/A)CT(GIAAr/C) 
TCIACI(G/A)(T/C)CAT-3' (SEQ ID N0:2) 

PGR using these primers was carried out with 2 |ig mouse genomic DNA at 94°C for 1 
10 min, 50°C for 2 min, and 72°C for 2 min for 40 cycles. 

PGR products of approximately 280 bp were gel-purified, digested with Eco Rl, 
gel-purified again, and subcloned in the Bluescript vector (Stratagene, San Diego, CA). 
Bacterial colonies carrying individual subclones were picked into 96 well microtiter 
plates, and multiple replicas were prepared by plating the cells onto nitrocellulose. The 
15 replicate filters were hybridized to probes representing known members of tfie family, 
and DNA was prepared from nonhybridizing colonies for sequence analysis. 

The primer combination of SJL141 and SJL147, encoding the amino acid sequences 
GW(H/Q/N/K/D/E)(D/N)W(V/l/M)(y/I^ (SEQ ID N0:9) and 

M(V/I/MyT/A)V(D/E)SC(G/A)C (SEQ ID NO: 1 0), respectively, yielded four previously 
20 identified sequences (BMP-4, inliibin,pB, GDF-3 and GDF-5) and one novel sequence, 
which was designated GDF-8, among 110 subclones analyzed. 

Human GDF-8 was isolated using the primers: 

ACM13: 5'-CGCGGATCCAGAGTCAAGGTGACAGACACAC-3* (SEQ ID N0:3); and 
ACM14: 5'-CGCGGATCCTCCTCATGAGCACCCACAGCGGTC-3' (SEQ IDN0:4) 



wo 98/33887 



PCTAJS98/02479 



42 - 



PGR using these primers was carried out with one ng human genomic DNA at 94°C for 
1 min, 58°C for 2 min, and 72°C for 2 min for 30 cycles. The PGR product was digested 
with Bam HI, gel-purified, and subcloned in the Bluescript vector (Stratagene, San 
Francisco, CA). 

^ EXAMPLE 2 

E XPRESSION PATTERN AN D SEOTJENrF nir nnir-« 

To determine the expression pattern of GDF-8, RNA samples prepared from a variety of 
adult tissues were screened by Northern analysis. RNA isolation and Northern analysis 
were carried out as described previously (Lee, S.J., Mol. Endocrinol., 4:1034, 1990) 

10 except that hybridization was canied out in 5X SSPE, 10% dextian sulfate, 50% 
fonnamide, Wo SDS, 200 ^g/ml salmon DNA, and O.io/o each of bovine serum albumin, 
ficoll, and polyvinylpyrrolidone. Five micrograms of twice poly A-selected RNA 
prepared from each tissue (except for muscle, for which only 2 ng RNA was used) were 
electrophoresed on formaldehyde gels, blotted, and probed with GDF-8. As shown in 

15 FIGURE 1, the GDF-8 probe detected a single raRNA species expressed at highest levels 
in muscle and at significantly lower levels in adipose tissue. 

To obtain a larger segment of the GDF-8 gene, a mouse genomic library was screened 
with a probe derived from the GDF-8 PGR product. The partial sequence of a GDF-8 
genomic clone is shown in FIGURE 2a. The sequence contains an open reading frame 

20 corresponding to the predicted C-terminal region of the GDF-8 precursor protein. The 
predicted GDF-8 sequence contains two potential proteolytic processing sites, which are 
boxed. Cleavage of the precursor at the second of these sites would generate a mature C 
terminal fragment 109 amino acids in length with a predicted molecular weight of 
12,400. The partial sequence of human GDF-8 is shown in HGURE 2b. Assuming no 

25 PGR-induced errors during the isolation of the human clone, the human and mouse amino 
acid sequences in this region are 100% identical. 
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The C-terminal region of GDF-8 following the putative proteolytic processing site shows 
significant homology to the known members of the TGF-p; superfamily (FIGURE 3). 
FIGURE 3 shows the alignment of the C-temiinal sequences of GDF-8 with the 
corresponding regions of human GDF-I {Ltc.Proc. Natl Acad Sci. 88:4250-4254, 
5 1991), human BMP-2 and 4 (Wozney, et al, Science, 242:1528-1534, 1988), human 
Vgr-1 (Celeste, et al Proc. Natl Acad Sci, USA, 87:9843-9847, 1990), human OP-1 
(Ozkaynak, et al., EMBO J., 9:2085-2093, 1990), human BMP-5 (Celeste, et al, Proc. 
Natl Acad ScL USA, 87:9843-9847, 1990), human BMP-3 (Wozney, et al, Science, 
242:1528-1534, 1988), human MiS (Gate, et al Cell, 45:685-698,1986), human inhibin 
10 alpha, PA, and pB (Mason, et al, Biochem, Biophys. Res, Commun., 135:957-964, 1986), 
human TGF-pI (Derynck, a/., A^a/ure, 316:701 -705, 1985), humanTGF-R2 (deMartin, 
et al, EMBOl, 6:3673-3677, 1987), and human TGF-P3 (ten Dijke, et al, Proc, Natl 
Acad Sci USA, 85:4715-4719, 1988). The conserved cysteine residues are boxed. 
Dashes denote gaps introduced in order to maximize the alignment. 

1 5 GDF-8 contains most of the residues that are highly conserved in other family members, 
including the seven cysteine residues with their characteristic spacing. Like the TGF-ps 
and inhibin Ps, GDF-8 also contains two additional cysteine residues. In the case of 
TGF-P2, these two additional cysteine residues are knovm to form an intramolecular 
disulfide bond (Daopm, etal, Science, 257:369, 1992; Schlunegger and Grutter, Nature, 

20 358:430,1992).- 

FIGURE 4 shows the amino acid homologies among the different members of the TGF-p 
superfamily. Numbers represent percent amino acid identities between each pair 
calculated firom the first conserved cysteme to the C terminus. Boxes represent 
homologies among highly-related members withm particular subgroups. In this region, 
25 GDF-8 is most homologous to Vgr-1 (45% sequence identity). 
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EXAMPLE 3 

ISOLATION OF cDNA CLONES F.NrO DING MURINE AND HUMAN nnv.fi 

In order to isolate full-length cDNA clones encoding murine and human GDF-g, cDNA 
libraries were prepared in the lambda ZAP II vector (Slratagene) using RNA prepared 
5 from skeletal muscle. From 5 jxg of twice poly A-selected RNA prepared from murine 
and human muscle, cDNA Ubraries consisting of 4.4 million and 1 .9 million recombinant 
phage, respectively, were constructed according to the instructions provided by 
Stratagene. These libraries were screened without amplification. Library screening and 
characterization of cDNA inserts were carried out as described previously (Lee, Mol. 
10 £«ctocn«o/., 4:1034-1040). 



From 2.4 x 10* recombinant phage screened from the murine muscle cDNA library, 
greater than 280 positive phage were identified using a murine GDF-8 probe derived 
from a genomic clone, as' described in Example 1 . The entire nucleotide sequence of the 
longest cDNA insert analyzed is shown in FIGURE 5a and 5b and SEQ ID NO: 11 . The 

15 2676 base pair sequence contams a single long open reading frame beginning with a 
methionine codon at nucleotide 104 and extending to a TGA stop codon at nucleotide 
1232. Upstream of the putative initiating methionine codon is an in-frame stop codon at 
nucleotide 23. The predicted pre-pro-GDF-8 protein is 76 amino acids in length. The 
sequence contains a core of hydrophobic amino acids at the N-terminus suggestive of a 

20 signal peptide for secretion (FIGURE 6a), one potential N-glycosylation site at 
asparagine 72, a putative RXXR proteolytic cleavage site at amino acids 264-267, and 
a C-tetminal region showing significant homology to the known members of the TGF-p 
superfemily. Cleavage of the precursor protein at the putative RXXR site would generate 
a mature C-terminal GDF-8 fragment 109 amino acids in length with a predicted 

25 molecular weight of approximately 1 2,400. 

From 1.9 x 10* recombinant phage screened from the human muscle cDNA library, 4 
positive phage were identified using a human GDF-8 probe derived by polymerase chain 
reaction on human genomic DNA. The entire nucleotide sequence of tiie longest cDNA 
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insert is shown in FIGURE 5c and 5d and SEQ ID NO: 13. The 2743 base pair sequence 
contains a single long open reading frame beginning with a methionine codon at 
nucleotide 59 and extending to a TGA stop codon at nucleotide 1184. The predicted 
pre-pro-GDF-8 protein is 375 amino acids in lengtli. The sequence contains a core of 
5 hydrophobic amino acids at the N-tenminus suggestive of a signal peptide for secretion 
(FIGURE 6bX one potential N-glycosylation site at asparagine 71, and a putative RX)(R 
proteolytic cleavage site at amino acids 263-266. FIGURE 7 shows a comparison of the 
predicted murine (top) and human (bottom) GDF-8 amino acid sequences. Numbers 
indicate amino acid position relative to the N-terminus. Identities between the two 
10 sequences are denoted by a vertical line. Murine and human GDF-8 ar^ approximately 
94% identical in the predicted pro-regions and 100% identical following the predicted 
RXXR cleavage sites. 

EXAMPLE 4 
DIMERIZATION OF GDF-8 

15 To determine whether the processing signals in the GDF-8 sequence are functional and 
whether GDF-8 forms dimers like other members of the TGF-fl superfamily, the GDF-8 
cDNA was stably expressed in CHO cells. The GDF-8 coding sequence was cloned into 
the pMSXND expression vector (Lee and Nathans, J. Biol Chem. , 263 :352 1 ,(1988) and 
transfected into CHO cells. Following G41 8 selection, the cells were selected in 0.2 fjM 

20 methotrexate, and conditioned medium from resistant cells was concentrated and 
electrophoresed on SDS gels. Conditioned medium was prepared by Cell Trends, Inc. 
(Middletown, MD). For preparation of anti-GDF-8 serum, the C-terminal region of 
GDF-8 (amino acids 268 to 376) was expressed in bacteria using the RSET vector 
(Invitrogen, San Diego, CA), purified using a nickle chelate column, and injected into 

25 rabbits. All immunizations were carried out by Spring Valley Labs (Woodbine, MD). 
Western analysis using [*^^I]iodoprotein A was carried out as described (Bumette, W.N., 
Anal Biochem., 112:195, 1981). Western analysis of conditioned medium prepared from 
these cells using an antiserum raised against a bacterially-expressed C-temiinal fragment 
of GDF-8 detected two protein species with apparent molecular weights of approximately 
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52K and 15K under reducing conditions, consistent with unprocessed and processed 
fomis of GDF-8, respectively. No bands were obtained either with preimmune serum or 
with conditioned medium from CHO cells tiansfected with an antisense construct. Under 
non-reducing conditions, the GDF-8 antiserum detected two predominant protein species 
with apparent molecular weights of approximately lOlK and 25K, consistent with 
dimeric forms of unprocessed and pi-ocessed GDF-8, respectively. Hence, like other 
TGF-fi family members, GDF-8 appears to be secreted and proteolytically processed, and 
the C-temiinal region appears to be capable of forming a disulfide-Iinked dimer. 

EXAMPLES 

PREPARATION ANTIBOniFS A GAINST anVJl Aivm 
EXPRESSION OF GPF-ft TiV MAMMAT JAN rvi i g 



hi order to prepare antibodies against GDF-8. GDF-8 antigen was expressed as a fiision 
protein in bacteria. A portion of murine GDF-8 cDNA spamiing amino acids 268-376 
(mature region) was inserted into the pRSET vector (Invitrogen) such that the GDF-8 

1 5 coding sequence was placed in frame with the initiating methionine codon present in the 
vector; the resulting construct created an open reading fiBme encoding a frision pix)tein 
witha molecular weight of approximately 16,600. The fusion construct was transformed 
into BL21 (DE3) ^JLysS) cells, and expression of the fusion protein was induced by 
treatment with isopropylthio-p-galactoside as described (Rosenberg, e> a/.; Gene, 

20 56:125-135). The fiision protein was then purified by metal chelate chromatography 
according to the instructions provided by hivitrogen. A Coomassie blue-stained gel of 
unpurified and purified fiision piotems is shown m FIGURE 8. 

•nie purified fusion protein was used to immunize both rabbits and chickens. Immuniza- 
tion of rabbits was carried out by Spring Valley Labs (SykesviUe, MD), and immuniza- 
25 tion of chickens was carried out by HRP, Inc. (Denver, PA). Western analysis of sera 
both from immunized rabbits and from immunized chickens demonstrated the presence 
of antibodies directed against the fiision protein. 
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To express GDF-8 in mammalian cells, the murine GDF-8 cDNA sequence from 
nucleotides 48-1 303 was cloned in both orientations downstream of the metallothionein 
I promoter in the pMSXND expression vector; this vector contains processing signals 
derived from SV40, a dihydrofolate reductase gene, and a gene conferring resistance to 
5 the antibiotic G4I8 (Lee and Nathans, J. Biol. Cham., 263:3521-3527). The resulting 
constructs were transfected into Chinese hamster ovary cells, and stable tranfectants were 
selected in the presence of G418. Two milliliters of conditioned media prepared from the 
G4I8-resistant cells were dialyzed, lyophilized, electrophoresed under denaturing, 
reducing conditions, transferred to nitrocellulose, and incubated with anti-GDF-8 
1 0 antibodies (described above) and ['"l]iodoproteinA. 

As shown in FIGURE 9, the rabbit GDF-8 antibodies (at a 1:500 dilution) detected a 
protein of approximately the predicted molecular weight for the mature C-terminal 
fragment of GDF-8 in the conditioned media of cells transfected with a construct in 
which GDF-8 had been cloned in the correct (sense) orientation with respect to the 
15 metallothionein promoter (lane 2); this band was not detected in a similar sample 
prepared from cells transfected with a control antisense construct (lane 1). Similar results 
were obtained using antibodies prepared m chickens. Hence, GDF-8 is secreted and 
proteolytically processed by these transfected mammalian cells. 

EXAMPLE 6 

20 EXPRESSION PATTERN OF GDF-8 

To determine the pattern of GDF-8, 5 ng of twice poly A-selected RNA prepared from 
a variety of murine tissue sources were subjected to Northern analysis. As shown in 
FIGURE 10a (and as shown previously in Example 2), the GDF-8 probe detected a single 
mRNA species present almost exclusively in skeletal muscle among a large number of 
25 adult tissues surveyed. On longer exposures of the same blot, significantly lower but 
detectable levels of GDF-8 mRNA were seen in fet, brain, thymus, heart, and lung. 
Hence, these results confirm the high degree of specificity of GDF-8 expression in 
skeletal muscle. GDF-8 mRNA was also detected in mouse embryos at both gestational 
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ages (day 12.5 and day 18.5 post-coital) examined but not in placentas at various stages 
of development (FIGURE 10b). 

To furtlier analyze the expression pattern of GDF-8, in situ hybridization was performed 
on mouse embryos isolated at various stages of development. 



5 



For all in situ hybridization experiments, probes conresponding to the C-terminal region 
of GDF-8 were excluded in order to avoid possible cross-reactivity with other members 
of the superfamily. Whole mount in situ hybridization analysis was carried out as 
described (Wilkinson, D.G., In Situ Hybridization. A Practical Approach, pp. 75-83, 
IRL. Press, Oxford, 1992) except that blocking and antibody incubation steps were 
10 carried out as in Knecht etal. (Knecht, etal., Development, 121:1927, 1955). Alkaline 
phosphatase reactions were carried out for 3 hours for day 10.5 embryos and overnight 
for day 9.5 embryos. Hybridization was carried out using digoxigenin- labelled probes 
spanning nucleotides 8-81 1 and 1298-2676, which correspond to the pro- region and 3' 
untranslated regions, respectively. In situ hybridization to sections was carried out as 
15 described (Wilkinson, et al., Cell, 50:79, 1987) using "S-labelled probes ranging from 
approximately 100-650 bases in length and spanning nucleotides 8-793 and 1566-2595. 
Following hybridization and washing, slides were dipped in NTB-3 photographic 
emulsion, exposed for 16-19 days, developed and stained with either hematoxylin and 
eosin or toluidine blue. RNA isolation, poly A selection, and Northern analysis were 
20 carried out as described previously (McPherron and Lee, J. Biol. Chem., 268:3444, 
1993). 

At all stages examined, the expression of GDF-8 mRNA appeared to be restricted to 
developing skeletal muscle. At early stages, GDF-8 expression was restricted to 
developing somites. By whole mount in situ hybridization analysis, GDF-8 mlWA could 
25 first be detected as early as day 9.5 post coitum in approximately one-third of the 
somites. At this stage of development, hybridization appeared to be restricted to the most 
mature (9 out of 21 in this example), rostral somites. By day 10.5 p.c, GDF-8 
expression was clearly evident in almost every somite (28 out of 33 in this example 
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shown). Based on in situ hybridization analysis of sections prepared from day 10.5 p.c. 
embryos, the expression of GDF-8 in somites appeared to be localized to the myotome 
compartment. At later stages of development, GDF-8 expression was detected in a wide 
range of developing muscles. 

5 GDF-8 continues to be expressed in adult animals as well. By Northern analysis, GDF-8 
mRNA expression was seen almost exclusively in skeletal muscle among the different 
adult tissues examined. A significantly lower though clearly detectable signal was also 
seen in adipose tissue. Based on Northem analysis of RNA prepared from a large 
number of different adult skeletal muscles, GDF-8 expression appeared to be widespread 
1 0 although the expression levels varied among individual muscles. 

EXAMPLE 7 
CHROMOSOMAL LOCALIZATION OF GDF-8 

In order to map the chromosomal location of GDF-8, DNA samples from human/rodent 
somatic cell hybrids (Drwinga, et aL, Genomics, 16:311-413,1993; Dubois and Naylor, 

1 5 Genomics, 16:3 1 5-3 1 9, 1 993) were analyzed by polymerase chain reaction followed by 
Southern blotting. Polymerase chain reaction was carried out using primer #83, 5'-C- 
GCGGATCCGTGGATCTAAATGAGAACAGTGAGC-3' (SEQ ID NO: 15) and primer 
#84, 5XGCGAATTCTCAGGTAATGATTGTTTCCGTTGTAGCG-3XSEQ IDN0:16) 
for 40 cycles at 94°C for 2 minutes, 60°C for 1 minute, and 72°C for 2 minutes. These 

20 primers correspond to nucleotides 119 to 143 (flanked by a Bam HI recognition 
sequence), and nucleotides 394 to 418 (flanked by an Eco Rl recognition sequence), 
respectively, in the human GDF-8 cDNA sequence. PGR products were electrophoresed 
on agarose gels, blotted, and probed with oligonucleotide #100, 
5*-ACACTAAATCTTCAAGAATA-3' (SEQ ID NO: 17), which corresponds to a 

25 sequence internal to the region flanked by primer #83 and #84. Filters were hybridized 
in 6 X SSC, 1 X Denhardt's solution, lOOfxg/ml yeast transfer RNA, and 0.05% sodium 
pyrophosphate at 50°C. 
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As shown in FIGURE 1 1 , the human-specific probe detected a band ofthe predicted size 
(approximately 320 base pairs) in the positive control sample (total human genomic 
DNA) and in a single DNA sample from the human/rodent hybrid panel. This positive 
signal conesponds to human chromosome 2. The human chramosome contained in each 
5 of tlie hybrid cell hnes is identified at the top of each ofthe first 24 lanes (1-22, X, and 
Y). In the lanes designated M, CHO, and H, the starting DNA template wLs total 
genomic DNA from mouse, hamster, and human sources, respectively. In the lane 
marked Bl, no template DNA was used. Numbers at left mdicate the mobilities of DNA 
standards. These data show that the human GDF-8 gene is located on chromosome 2. 



10 



EXAMPLE a 
GDF-8 TRAN SGENIC KNOrROUT MTPF 

TTxe GDF-8, we disrupted the GDF-S gene was disrupted by homologous targeting in 
embryonic stem cells. To ensure that the resulting mice would be null for GDF-8 
fiinction, the entire mature C-teiminal region was deleted and replaced by a neo cassette 

15 (Figure 12a). A murine 129 SV/J genomic library was prepared in lambda FK H 
accoidmg to the instructions provided by Stratagene (La JoUa, CA). The structure ofthe 
GDF-8 gene was deduced from restriction mapping and partial sequencing of phage 
clones isolated from this libraiy. Vectors for preparing the targetmg construct were 
kindly provided by Philip Soriano and Kirk Thomas University. Rl ES cells were trans- 

20 fected with the targeting construct, selected with gancyclovir (2 ^M) and G418 (250 
f^g/ml), and analyzed by Southern analysis. Homologously targeted clones were injected 
into C57BL/6 blastocysts and transferred into pseudopregnant females. Germline 
transmission ofthe targeted aUele was obtained in a total of 9 male chimeras from 5 
independently-derived ES clones. Genomic Southern blots were hybridized at 42°C as 

25 described above and washed in 0.2X SSC, 0. 1% SDS at 42°C. 

For whole leg analysis, legs of 14 week old mice were skinned, treated with 0.2 M EDTA 
in PBS at 4°C for 4 weeks followed by 0.5 M sucrose in PBS at 4°C. For fiber number 
and size analysis, samples were directly mounted and frozen in isopentane as described 
(Bmmbackand Leech, Color Atlas of Muscle Histochemistry, pp. 9-33. PSG Publishing 
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Company, Littleton, MA, 1984). Ten to 30 ptm sections were prepared using a cryostat 
and stained with hematoxylin and eosin. Muscle fiber numbers were determined firom 
sections taken fi:om the widest part of the tibialis cranialis muscle. Muscle fiber sizes 
were measured firom photographs of sections of tibialis cranialis and gastrocnemius 
5 muscles. Fiber type analysis was carried out using the mysosin ATPase assay after 
pretreatment at pH 4.35 as described (Gumming, et aL, Color Atlas of Muscle Pathology, 
pp. 184-185, 1994) and by inununohistochemistry using an antibody directed against 
type I myosin (MY32, Sigma) and the Vectastain method (Vector Labs); in the 
immunohistochemical experiments, no staining was seen when the primary antibodies 
10 were left out. Carcasses were prepared firom shaved mice by removing the all of the 
internal organs and associated fat and connective tissue. Fat content of carcasses fi-om 
4 month old males was determined as described (Leshner, et aL, Physiol Behavior, 
9:281, 1972). 

For protein and DNA analysis, tissue was homogenized in 150 mM NaCl, 100 mM 
15 EDTA. Protein concentrations were determined using the Biorad protein assay. DNA 
was isolated by adding SDS to 1%, treating with 1 mg/ml proteinase K overnight at 
55°C, extracting 3 times with phenol and twice with chloroform, and precipitating with 
ammonium acetate and EtOH. DNA was digested with 2 mg/ml RNase for 1 hour at 
37°C, and following proteinase K digestion and phenol and chloroform extractions, the 
20 DNA was precipitated twice with ammonium acetate and EtOH. 

Homologous targeting of die GDF-8 gene was seen in 13/131 gancyclovir/G418 
doubly-resistant ES cell clones. Following injection of these targeted clones into 
blastocysts, we obtained chimeras firom 5 independently-derived ES clones that produced 
heterozygous pups when crossed to C57BL/6 females (Figure 12b). Genotypic analysis 
25 of 678 offspring derived firom crosses of Fl heterozygotes showed 170 +/+ (25%), 380 
+/- (56%), and 128 -/- (19%). Although the ratio of genotypes was close to the expected 
ratio of 1 :2 : 1 , the smaller than expected number of homozygous mutants appeared to be 
statistically significant (p<0.00 1), 
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Homozygous mutants were viable and fertile when crossed to C57BL/6 mice and to each 
other. Homozygous mutant animals, however, were approximately 30% larger than their 
heterozygous and wild type litteimates (Table 1). Hie difference between mutant and 
wild type body weights appeared to be relatively constant irrespective of age and sex in 

5 adult animals. Adult mutants also displayed an abnormal body shape, with pronounced 
shoulders and hips. When the skin was removed from animals that had been sacrificed, 
it was apparent that the muscles of the mutants were much larger than those of wild type' 
animals. The increase in skeletal muscle mass appeared to be widespread throughout the 
body. Individual muscles isolated from homozygous mutant animals weighed 

0 approximately 2-3 times more than those isolated from wild type littennates (Table 2). 
Ahhough the magnitude of the weight increase appeared to roughly correlate with the 
level of GDF-8 expression in the muscles examined. To determine whether the increased 
muscle mass could account for the entire difference in total body weights between wild 
type and mutant animals or whether many tissues were generally larger in the mutants, 

5 we compared the total body weights to carcass weights. As shown in Table 3, the 
difference in carcass weights between wild type and mutant animals was comparable to 
the difference in total body weights. Moreover, because the fat content of mutant and 
wild type animals was similar, these data are consistent with all of the total body weight 
difference resulting from an increase in skeletal muscle mass, although we have not 
formally ruled out the possibility that differences in bone mass might also contribute to 
the differences in total body mass. 



To determine whether the increase in skeletal muscle mass resulted from hyperplasia or 
from hypertrophy, histologic analysis of several different muscle groups was perfoimed. 
The mutant muscle appeared grossly normal. No excess connective tissue or fat was seen 
nor were there any obvious signs of degeneration, such as widely varying fiber sizes (see 
below) or centrally-placed nuclei. Quantitation of the number of muscle fibers showed 
that at the widest portion of the tibialis cranialis muscle, the total cell number was 86% 
higher in mutant animals compared to wild type littermates [mutant = 5470 +/- 121 (n 
= 3), wild type = 2936 +/- 288 (n = 3); p < 0.01]. Consistent with this result was the 
finding that the amount of DNA extracted from mutant muscle was roughly 50% higher 
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than from wild type muscle [mutant = 350 //g (n = 4), wild type = 233 /xg (n 3) from 
pooled gastrocnemius, plantaris, triceps brachii, tibialis cranialis, and pectoralis muscles; 
p = 0.05]. Hence, a large part of the mcrease in skeletal muscle mass resulted from 
muscle cell hyperplasia. However, muscle fiber hypertrophy also appeared to contribute 
5 to the overall increase in muscle mass. As shown in Figure 13, the mean fiber diameter 
of the tibialis cranialis muscle and gastrocnemius muscle was 7% and 22% larger, 
respectively, in mutant animals compared to wild type littermates, suggesting that the 
cross-sectional area of the fibers was increased by approximately 14% and 49%, 
respectively. Notably, although the mean fiber diameter was larger in the mutants, the 
10 standard deviation in fiber sizes was similar between mutant and wild type muscle, 
consistent with the absence of muscle degeneration in mutant animals. The increase in 
fiber size was also consistent with the finding that the protein to DNA ratio (w/w) was 
slightly increased in mutant compared to wild type muscle [mutant = 871 +/- 1 1 1 (n = 
4), wild type = 624 +/- 85 (n = 3); p < 0.05]. 

15 Table 4 shows a comparison between muscle weight (in grams) from wild-type (+/+), 
heterozyous (+/-) and a homozygous knock-out mice (-/-). The muscle mass is increased 
in heterozyogous as compared to wild-type animals. 

Finally, fiber type analysis of various muscles was carried out to detemiine whether the 
number of both type I (slow) and type II (fast) fibers was increased in the mutant 

20 animals. In most of the muscles examined, including the tibialis cranialis muscle, the 
vast majority of muscle fibers were type II in both mutant and wild type animals. Hence, 
based on the cell counts discussed above, the absolute number of type n fibers were 
increased in the tibialis cranialis muscle. In the soleus muscle, where the number of type 
I fibers was sufficiently high that we could attempt to quantitate the ratio of fiber types 

25 could be quantiated, the percent of type I fibers was decreased by approximately 33% in 
mutant compared to wild type muscle [wild type = 39.2 +/- 8.1 (n = 3), mutant = 26.4 +/- 
9.3 (n = 4)]; however, the variability in this ratio for both wild type and mutant animals 
was too high to support any firm conclusions regarding the relative number of fiber 
types. 
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EXAMPr.K9 
ISOLATION OF RAT A^fn r-ty ^cKEN GDF-S 

In order to isolate rat and chicken GDF-8 cDNA clones, skeletal muscle cDNA libmries 
prepared from these species were obtained from Stratagene and screened with a muiine 
5 GDF-8 probe. Library screening was carried out as described previously (Lee, Mol. 
Endocrinol, 4: 1034-1040) except that final washes were carried out in 2 X SSC at 65 ''C. 
Partial sequence analysis of hybridizing clones revealed the presence of open reading 
Ws highly related to murine and human GDF-8. Partial sequences of rat and chicken 
GDF-8 are shown in Figures 2c and 2d, respectively, and 'an alignment of the predicated 

10 rat and chicken GDF-8 amino acid sequences with those of murine and human GDF-8 
are shown in Figure 3b. Full length rat and chicken GDF-8 is shown m Figures 14d and 
14c, respectively and sequence alignment between murine, lat, human, baboon, porcine, 
ovine, bovine, chicken, and turkey sequences is shown m Figures 15a and 15b. All 
sequences contain an RSRR sequence that is likely to represent the proteolytic processing 

15 site. Following this RSRR sequence, the sequences contam a C-tenninal region that is 
100% conserved among all four species. The absolute conservation of the C-temunal 
region between species as evolutionarily far apart as humans and chickens, and baboons 
and turkeys, suggests that this region will be highly conserved m many other species as 
well. 

20 Similar methodology was used to obtain the nucleotide and amino acid sequences for 
baboon (SEQ ID NO:18 and 19, respectively; Figure 14a); bovine (SEQ ID NO:20 and 
21, respectively; Figure 14b); turkey (SEQ ID NO:26 and 27, respectively; Figure 14e); 
poreine (SEQ ID NO:28 and 29, respectively; Figure 14f); and ovme (SEQ ID NO:30 
and 31, respectively; Figure 14g). 

25 Although the invention has been described with reference to the presently preferred 
embodiment, it should be understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, the invention is limited only by 
the following claims. 



wo 98/33887 



PCTAJS98/02479 



- 55 - 



SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: The Johns Hopkins University School of 
Medicine 

(ii) TITLE OF THE INVENTION: GROWTH DIFFERENTIATION FACTOR-8 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US98/ 

(B) FILING DATE: 05 -February- 199 8 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/795,071 

(B) FILING DATE: 05-February-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/847,910 

(B) FILING DATE: 28-April-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/862,445 

(B) FILING DATE: 23-May-1997 

(viii) ATTORNEY/ AGENT lOTORMATION : 

(A) NAME: Lisa A. Haile, Ph.D. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE /DOCKET NUMBER: 07265/12 9W01 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vii) IMMEDIATE SOURCE: 
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(B) CLONE: SJL141 

(ix) FEATURE: 

(A) NAME/KEY: Modified Base 

(B) LOCATION: 1. . .35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCGGAATTCG GNTGGVANRA YTGGRTNRTN NKCNCC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE- 
(B) CLONE: SJL147 

(ix) FEATURE: 

(A) NAME/KEY; CDS 

(B) LOCATION: 1. . .33 

(Xi) SEQUENCE DESCRIPTION; SEQ ID N0:2: 
CCGGAATTCR CANSCRCARC TNTCNACNRY CAT 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE- 
(B) CLONE: ACM13 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . .32 
(D) OTHER INFORMATION; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CGCGGATCCA GAAGTCAAGG TGACAGACAC AC 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 



33 



32 
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(vii) IMMEDIATE SOURCE: 
{B) CLONE: ACM14 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. . .33 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

CGCGGATCCT CCTCATGAGC ACCCACAGCG GTC 33 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: mouse GDF-8 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 59... 436 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TTAAGGTAGG AAGGATTTCA GGCTCTATTT ACATAATTGT TCTTTCCTTT TCACACAG 58 

AAT CCC TTT TTA GAA GTC AAG GTG ACA GAC ACA CCC AAG AGO TCC CGG 106 
Asn Pro Phe Leu Glu Val Lys Val Thr Asp Thr Pro Lys Arg Ser Arg 
15 10 15 



AGA GAC TTT GGG CTT GAC TGC GAT GAG CAC TCC ACG GAA TCC CGG TGC 154 
Arg T^p Phe Gly.Leu Aap Cys Asp Glu His Ser Thr Glu Ser Arg Cys 
20 25 30 

TGC CGC TAC CCC CTC ACG GTC GAT TTT GAA GCC TTT GGA TGG GAC TGG 2 02 
Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu Ala Phe Gly Trp Asp Trp 
35 40 45 

ATT ATC GCA CCC AAA AGA TAT AAG GCC AAT TAC TGC TCA GGA GAG TGT 250 
lie lie Ala Pro Lys Arg Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys 
50 55 60 

GAA TTT GTG TTT TTA CAA AAA TAT CCG CAT ACT CAT CTT GTG CAC CAA 298 
Glu Phe Val Phe Leu Gin Lys Tyr Pro His Thr His Leu Val His Gin 
65 70 75 80 

GCA AAC CCC AGA GGC TCA GCA GGC CCT TGC TGC ACT CCG ACA AAA ATG 346 
Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys Cys Thr Pro Thr Lys Met 
85 90 95 
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TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC AAA GAA CAA ATA ATA TAT 3 94 
Ser Pro He Asn Met Leu Tyr Phe Asn Gly Lys Glu Gin lie lie lyr 
100 105 ' 



115 120 - 



ATTAGGTTAG AAACncCCA AGTCATGGAA GGTCTTCCCC TCAATTTCGA AACTGTGAAT 506 
TCCTGCAGCC CGGGGGATCC ACTAGTTCTA GAGCGGCCGC CACC 550 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



1 

/ Jjeu ASD CVS Asn r:in TT-i o C^-^ rru^ /-.I., n._„ 

20 3Q 



35 An 



50 
Glu Ph€ 
65 



DESCRIPTION 


: SEQ ID 


N0:6: 


Val 


Lys 


Val 


Thr 


Asp 


Thr Pro 










10 




Asp 


Cys 


Asp 


Glu 


His 


Ser Thr 








25 






Thr 


Val 


Asp 


Phe 


Glu 


Ala Phe 






40 








Arg 


Tyr 


Lys 


Ala 


Asn 


Tyr Cys 


Gin 


55 








60 


Lys 


Tyr 


Pro 


His 


Thr His 


70 










75 


Ser 


Ala 


Gly 


Pro 


Cys 


Cys Thr 










90 




Leu 


Tyr 


Phe 


Asn 


Gly 


Lys Glu 








105 




Met 


Val 


Val 


Asp 


Arg 


Cys Gly 



80 

J c,er AX a uiy Pro Cys Cys Thr Pro Thr Lys Met 
85 90 5^ 

-..^ l^ys Glu Gin lie lie Tv: 

100 105 110 ^ 

Pro Ala Met Val Val Asp Arg Cys Gly Cys Ser 

115 ion 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: human GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . .326 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
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CA AAA AGA TCC AGA AGO GAT TTT GGT CTT GAG TGT GAT GAG CAC TCA 47 

Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His Ser 

15 10 15 

AGA GAA TCA CGA TGC TGT CGT TAG CCT CTA ACT GTG GAT TTT GAA GCT 95 
Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu Ala 
20 25 30 

TTT GGA TGG GAT TGG ATT ATC GCT CCT AAA AGA TAT AAG GCC AAT TAG 143 
Phe Gly Trp Asp Trp lie He Ala Pro Lys Arg Tyr Lys Ala Asn Tyr 
35 40 45 



TGC TCT GGA GAG TGT GAA TTT GTA TTT TTA CAA AAA TAT CCT CAT ACT 191 
Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His Thr 
50 55 60 

CAT CTG GTA CAC CAA GCA 7VAC CCC AGA GGT TCA GCA GGC CCT TGC TGT 23 9 

Hxs Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys Cys 
65 10 75 

ACT CCC ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT TTT AAT GGC AAA 2 87 

Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly Lys 
80 85 90 95 

GAA CAA ATA ATA TAT GGG AAA ATT CCA GCG ATG GTA GTA 326 
Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val 
100 105 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His Ser Thr 

1 5 10 15 

Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu Ala Phe 

20 25 30 

Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn Tyr Cys 

35 40 45 

Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His Thr His 

50 55 60 

Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys Cys Thr 
^5 70 75 80 

Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly Lys Glu 

85 90 95 

Gin He He Tyr Gly Lys He Pro Ala Met Val Val 
100 105 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: andno acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE- 
(B). CLONE: SJL141 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . .9 



V.1, He, or Met; Xa! .t posiiioaT. "i " 6 and 7 1= 



Ala or Ser. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

Gly Trp Xaa Xaa Trp Xaa Xaa Xaa Pro 
^ 5 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL147 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . .8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Xaa Val Xaa Ser Cys Xaa Cys 
^ 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 2676 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Murine GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.,.2676 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ill: 

GTCTCTCGGA CGGTACATGC ACTAATATTT CACTIX^CAT TACTCAAAAG CAAAAAOAAG 60 
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AAATAAGAAC AAGGGAAAAA AAAAGATTGT GCTGATTTTT AAA ATG ATG CAA AAA 115 

Met Met Gin Lys 
1 

CTG CAA ATG TAT GTT TAT ATT TAG CTG TTC ATG CTG ATT GOT GOT GGC 163 
Leu Gin Met Tyr Val Tyr He Tyr Leu Phe Met Leu He Ala Ala Gly 
5 10 15 20 

CCA GTG GAT CTA AAT GAG GGC AGT GAG AGA GAA GAA AAT GTG GAA AAA 211 
Pro Val Asp Leu Asn Glu Gly Ser Glu Arg Glu Glu Asn Val Glu Lys 
25 30 35 



GAG GGG CTG TGT AAT GCA TGT GCG TGG AGA CAA AAC ACG AGG TAG TCC 259 
Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn Thr Arg Tyr Ser 
40 45 50 

AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGT AAG CTG CGC CTG GAA 307 
Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

ACA GCT CCT AAC ATC AGC AAA GAT GCT ATA AGA CAA CTT CTG CCA AGA 355 
Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu Leu Pro Arg 
70 75 80 

GCG CCT CCA CTC CGG GAA CTG ATC GAT CAG TAC GAC GTC CAG AGG GAT 403 
Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGC AGT GAT GGC TCT TTG GAA GAT GAC GAT TAT CAC GCT ACC ACG 451 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAA ACA ATC ATT ACC ATG CCT ACA GAG TCT GAC TTT CTA ATG CAA GCG 499 
Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu Met Gin Ala 
120 125 130 

GAT GGC AAG CCC AAA TGT TGC TTT TTT AAA TTT AGC TCT AAA ATA CAG 547 
Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 
135 140 145 

TAC AAC AAA GTA GTA AAA GCC CAA CTG TGG ATA TAT CTC AGA CCC GTC 595 
Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Pro Val 
150 155 160 

AAG ACT CCT ACA ACA GTG TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC 643 
Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu He Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGG TAT ACT GGA ATC CGA TCT CTG AAA CTT GAC 691 
Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AGC CCA GGC ACT GGT ATT TGG CAG AGT ATT GAT GTG AAG ACA GTG 739 
Met Ser Pro Gly Thr Gly He Trp Gin Ser lie Asp Val Lys Thr Val 
200 205 210 

TTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAC TTA GGC ATT GAA ATC 787 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

AAA GCT TTG GAT GAG AAT GGC CAT GAT CTT GCT GTA ACC TTC CCA GGA 835 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 
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CCA GGA GAA GAT GGG CTG AAT CCC TTT 
Pro Gly Glu Asp Gly Leu Asn Pro Phe 
245 250 

ACA CCC AAG AGG TCC CGG AGA GAC TTT 
Thr Pro Lys Arg Ser Arg Arg Asp Phe 
265 

TCC ACG GAA TCC CGG TGC TGC CGC TAG 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr 
280 285 



62 - 



TTA GAA GTC AAG GTG ACA GAC 883 
Leu Glu Val Lys Val Thr Asp 
. 255 260 

GGG CTT GAC TGC GAT GAG CAC 931 
Gly Leu Asp Cys Asp Glu His 
270 275 

CCC CTC ACG GTC GAT TTT GAA 979 
Pro Leu Thr Val Asp Phe Glu 
290 



GCC TTT GGA TGG GAC TGG ATT ATC GCA CCC AAA AGA TAT AAG GCC AAT 1027 
Ala Phe Gly Trp Asp Trp lie He Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCA GGA GAG TGT GAA TTT GTG TTT TTA CAA AAA TAT CCG CAT 1075 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAT CTT GTG CAC CAA GCA AAC CCC AGA GGC TCA GCA GGC CCT TGC 1123 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGC ACT CCG ACA AAA ATG TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC 1171 
Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCC ATG GTA GTA GAC CGC 1219 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 

TGT GGG TGC TCA TGAGCTTTGC ATTAGGTTAG AAACTTCCCA AGTCATGGAA GGTCT 1276 
Cys Gly Cys Ser 
375 

TCCCCTCAAT TTCGAAACTG TGAATTCAAG CACCACAGGC TGTAGGCCTT GAGTATGCTC 133 6 

TAGTAACGTA AGCACAAGCT ACAGTGTATG AACTAAAAGA GAGAATAGAT GCAATGGTTG 1396 

GCATTCAACC ACCAAAATAA ACCATACTAT AGGATGTTGT ATGATTTCCA GAGTTTTTGA 1456 

AATAGATGGA GATCAAATTA CATTTATGTC CATATATGTA TATTACAACT ACAATCTAGG 1516 

CAAGGAAGTG AGAGCACATC TTGTGGTCTG CTGAGTTAGG AGGGTATGAT TAAAAGGTAA 1576 

AGTCTTATTT CCTAACAGTT TCACTTAATA TTTACAGAAG AATCTATATG TAGCCTTTGT 1636 

AAAGTGTAGG ATTGTTATCA TTTAAAAACA TCATGTACAC TTATATTTGT ATTGTATACT 1696 

TGGTAAGATA AAATTCCACA AAGTAGGAAT GGGGCCTCAC ATACACATTG CCATTCCTAT 1756 

TATAATTGGA CAATCCACCA CGGTGCTAAT GCAGTGCTGA ATGGCTCCTA CTGGACCTCT 1816 

CGATAGAACA CTCTACAAAG TACGAGTCTC TCTCTCCCTT CCAGGTGCAT CTCCACACAC 1876 

ACAGCACTAA GTGTTCAATG CATTTTCTTT ATVGGAAAGAA GAATCTTTTT TTCTAGAGGT 1936 

CAACTTTCAG TCAACTCTAG CACAGCGGGA GTGACTGCTG CATCTTAAAA GGCAGCCAAA 1996 

CAGTATTCAT TTTTTAATCT AAATTTCAAA ATCACTGTCT GCCTTTATCA CATGGCAATT 2 056 

TTGTGGTAAA ATAATGGAAA TGACTGGTTC TATCAATATT GTATAAAAGA CTCTGAAACA 2116 

ATTACATTTA TATAATATGT ATACAATATT GTTTTGTAAA TAAGTGTCTC CTTTTATATT 2176 

TACTTTGGTA TATTTTTACA CTAATGAAAT TTCAAATCAT TAAAGTACAA AGACATGTCA 2236 

TGTATCACAA AAAAGGTGAC TGCTTCTATT TCAGAGTGAA TTAGCAGATT CAATAGTGGT 2296 

CTTAAAACTC TGTATGTTAA GATTAGAAGG TTATATTACA ATCAATTTAT GTATTTTTTA 2356 

CATTATCAAC TTATGGTTTC ATGGTGGCTG TATCTATGAA TGTGGCTCCC AGTCAAATTT 2416 

CAATGCCCCA CCATTTTAAA AATTACAAGC ATTACTAAAC ATACCAACAT GTATCTAAAG 2476 

AAATACAAAT ATGGTATCTC AATAACAGCT ACTTTTTTAT ITTATAATTT GACAATGAAT 253 6 

ACATTTCTTT TATTTACTTC AGTTTTATAA ATTGGAACTT TGTTTATCAA ATGTATTGTA 2596 

CTCATAGCTA AATGAAATTA TTTCTTACAT AAAAATGTGT AGAAACTATA AATTAAAGTG 2656 

TTTTCACATT TTTGAAAGGC 2676 

(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Met Gin Lys Leu Gin Met Tyr Val Tyr He Tyr Leu Phe Met Leu 

lie Ala Ma Gly Pro Val Asp Leu Asn GlS Gly Ser Glu oil Glu 

Asn val Glu Lys Glu Gly Leu Cys Asn Ala Cys Ala Trp Gin Asn 

Thr Arg Tyr Ser Arg lie Glu Ala He Lys He Gin Al Leu Ser Lys 

Leu Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Ala He Axg Gin 

Leu Leu Pro Arg Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr As'^ 

val Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp As^ Tyr 

His Ala Thr Thr Glu Thr He lie T^r Met Pro Thr Glu III Asp Ph. 

Leu Met Gin Ala Asp Gly Lys Pro Lys Cys Cys Phe III Lys Phe Ser 

ser Lys He Gin Tyr Asn Lys Val Val Lys Ala G^n Leu Trp He Tyr 

Leu Arg Pro Val Lys Thr Pro Thr Thr Val HI Val Gin He Leu H', 

Leu He Lys Pro Met Lys Asp Gly Thr ^ xyr Thr Gly He S Ser 

Leu Lys Leu Asp Met Ser Pro Gly Thr Gly He Trp Gin III He Asp 

val Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro III ser Asn Leu 

Gly He Glu He Lys Ala Leu Asp Glu Asn Gly 'Al Asp Leu Ala Val 

Thr Phe Pro Gly Pro Gly Glu Asp Gly Leu III Pro Phe Leu Glu 111 

Lys val Thr Asp Thr Pro Lys Arg Ser Arg Asp Phe Gly III Asp 

cys Asp Glu His ser Thr Glu Ser ArJ Cys Cys Arg Tyr III Leu Thr 

val ASP Phe Glu Ala Phe Gly Trp Asp Trp He He III Pro Lys Arg 

Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu III Val Phe Leu Gin 

Lys Tyr Pro His Thr His Leu Val His Gin III Asn Pro Arg Gly HI 

Ala Gly P,o Cys Cy. Thr Pro Thr Lys Hi Ser Pro He Asn '.11 Leu 

xyr Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He III Ala Met 

Val Val Asp Arg Cys Gly Cys Ser 
370 



(2) INFORMATION FOR SEQ ID NO: 13; 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 2743 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Human GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1,..2743 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

AAGAAAAGTA AAAGGAAGAA ACAAGAACAA GAAAAAAGAT TATATTGATT TTAAAATC 58 

ATG CAA AAA CTG CAA CTC TGT GTT TAT ATT TAC CTG TTT ATG CTG ATT 106 
Met Gin Lys Leu Gin Leu Cys Val Tyr lie Tyr Leu Phe Met Leu lie 
1 5 . 10 15 

GTT GCT GGT CCA GTG GAT CTA AAT GAG AAC AGT GAG CAA AAA GAA AAT 154 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 

GTG GAA AAA GAG GGG CTG TGT AAT GCA TGT ACT TOG AGA CAA AAC ACT 202 
Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 

AAA TCT TCA AGA ATA GAA GCC ATT AAG ATA CAA ATC CTC AGT AAA CTT 250 
Lys Ser Ser Arg lie Glu Ala lie Lys He Gin He Leu Ser Lys Leu 
50 55 60 

CGT CTG GAA ACA GCT CCT AAC ATC AGC AAA GAT GTT ATA AGA CAA CTT 298 
Arg Leu Glu Thr Ala Pro Asn He Ser Lys Asp Val He Arg Gin Leu 
65 70 75 80 

TTA CCC AAA GCT CCT CCA CTC CGG GAA CTG ATT GAT CAG TAT GAT GTC 346 
Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val 
85 90 95 

CAG AGG GAT GAC AGC AGC GAT GGC TCT TTG GAA GAT GAC GAT TAT CAC 394 
Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

GCT ACA ACG GAA ACA ATC ATT ACC ATG CCT ACA GAG TCT GAT TTT CTA 442 
Ala Thr Thr Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 

ATG CAA GTG GAT GGA AAA CCC AAA TGT TGC TTC TTT AAA TTT AGC TCT 490 
Met Gin Val Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

AAA ATA CAA TAC AAT AAA GTA GTA AAG GCC CAA CTA TGG ATA TAT TTG 538 
Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu 
145 150 155 160 

AGA CCC GTC GAG ACT CCT ACA ACA GTG TTT GTG CAA ATC CTG AGA CTC 586 
Arg Pro Val Glu Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 



ATC AAA CCT ATG AAA GAC GGT ACA AGG TAT ACT GGA ATC CGA TCT CTG 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 



634 
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AAA CTT 
Lys Leu 



AAG ACA 
Lys Thr 
210 

ATT GAA 
He Glu 
225 



GAC ATG AAC CCA GGC ACT GGT ATT ran r^n r.nn ^mc. ^ 
Asp Met Asn Pro Glv Thr- 7^t ti ^ ^^'^ GTG 

195 ^ Trp Gin Ser He Asp Val 

^UO 205 

vS s ?| s ^ If. - „c ™ 

-^•^^ 220 

s SI r r V "^^'^ 

230 ^ ^ Thr 

240 

?S S Pro StJ S Su r T --'^ ^« 

245 ^ ^ P^^^ J^eu Glu Val Lys 

250 255 

265 270 

st^ s ?S S -^^^ 

r Tiir Glu ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
zao 285 

320 

?g g s ^? g= Si - - ^ ^ 

•^^^ 365 

vll S Cy^ sS ArrAACCCTT CATAACTXCC TAAAAC 

370 



ATGGAAGGTT 
CCTAGAGTAT 
ATATGCAATG 
CAGAGTTTTT 
GGCGAGGAAA 
GTCTATTTCT 
AATGCAGGAT 
ATGGTAGTAT 
TTTCCATTCC 
GATAGGCTGA 
AAGTTTCTCT 
TAATGTAAGA 
AACTGCATTA 
CATACTTGGA 
TGCAGTTTTT 
TGATWICAATG 



TTCCCCTCAA 

GCTACAGTCA 

GTTGGCATTT 

GAGCTAGAAG 

TGAAAGCGAT 

TTAAAGTTTT 

TGTTATATAC 

ACTTGGTAAG 

TATTATAATT 

ATGTCTGAGG 

TTTCTTCAGG 

AGAATCATTT 

TCTTAAT^AGG 

GAAGTATGTA 

ATGGTAAAAT 

CATTTATATA 



CAATTTTGAA 
CTTAAGCATA 
AACCATCCAA 
QAGATCAAAT 
TCTCCTTGAG 
GTTTAATATT 
CATCATTCGA 
ATAAAATTCC 
GACACAGTAC 
CTACCAGGTT 
TGCATTTTCC 
TTCTAGAGGT 
C AGTC AAATG 
ATTTTGTCTT 
AATAGAAATG 
ATATGTATAC 



GCTGTGAAAT 
AGCTACAGTA 
ACAAATCATA 
TACATTTATG 
TTCTGATGAA 
TACAGAAAAA 
ATCATCCTTA 
ACAAAAATAG 
ATTAACAATC 
TATCACATAA 
TACACCTCCA 
TGGCTTTCAA 
GTGTTTGTTT 
XGGAAAATTA 
ATCGACTCTA 
AATATTGTTT 



TAAGTACCAC 
TGTAAACTAA 
CAAGAAAGTT 
TTCCTATATA 
TTAAAGGAGT 
TCCACATACA 
AACACTTGAA 
GGATGGTGCA 
CATGCCAACG 
AAAACATTCA 
AATGAGGAAT 
TTCTGTAGCA 
TTATCAAAAT 
CAACACTGCC 
TCAATATTGT 
TGTAAATAAG 



aggctatagg' 

AAGGGGGAAT 
TTATGATTTC 
TTACAACATC 
ATGCTTTAAA 
GTATTGGTAA 
TTTATATTGT 
GCATATGCAA 
GTGCTAATAC 
GTAAAATAGT 
GGATTTTCTT 
TACTTGGAGA 
GTCAAAATAA 
TTTGCAACAC 
ATAAAAAGAC 
TGTCTCCTTT 



682 



730 



778 



826 



874 



922 



970 



1018 



1066 



1114 



1162 



1219 



1279 

1339 . 

1399 

1459 

1519 

1579 

1639 

1699 

1759 

1819 

1879 

1939 

1999 

2059 

2119 

2179 
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TTTATTTACT TTGGTATATT TTTACACTAA GGACATTTCA AATTAAGTAC TAAGGCACAA 2 23 9 

AGACATGTCA TGCATCACAG AAAAGCAACT ACTTATATTT CAGAGCAAAT TAGCAGATTA 2299 

AATAGTGGTC TTAAAACTCC ATATGTTAAT GATTAGATGG TTATATTACA ATCATTTTAT 2359 

ATTTTTTTAC ATGATTAACA TTCACTTATG GATTCATGAT GGCTGTATAA AGTGAATTTG 2419 

AAATTTCAAT GGTTTACTGT CATTGTGTTT AAATCTCAAC GTTCCATTAT TTTAATACTT 2479 

GCAAAAACAT TACTAAGTAT ACCAAAATAA TTGACTCTAT TATCTGAAAT GAAGAATAAA 253 9 

CTGATGCTAT CTCAACAATA ACTGTTACTT TTATTTTATA ATTTGATAAT GAATATATTT 2599 

CTGCATTTAT TTACTTCTGT TTTGTAAATT GGGATTTTGT TAATCAAATT TATTGTACTA 2 659 

TGACTAAATG AAATTATTTC TTACATCTAA TTTGTAGAAA CAGTATAAGT TATATTAAAG 2719 

TGITTTCACA TTTTTTTGAA AGAC 274 3 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CH7VRACTERISTICS : 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE :. internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met 


Gin 


Lys 


Leu 


Gin 


Leu 


Cys 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Met 


Leu 


He 


1 








5 










10 










15 




Val 


Ala 


Gly 


Pro 


Val 


Asp 


Leu 


Asn 


Glu 


Asn 


Ser 


Glu Gin Lys 


Glu 


Asn 








20 










25 










30 






Val 


Glu 


Lys 
35 


Glu 


Gly 


Leu 


Cys 


Asn 
40 


Ala 


Cys 


Thr 


Trp 


Arg 
45 


Gin 


Asn 


Thr 


Lys 


Ser 


Ser 


Arg 


He 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 


Leu 




50 










55 










60 








Arg 


Leu 


Glu 


Thr 


Ala 


Pro 


Asn 


He 


Ser 


Lys 


Asp 


Val 


He 


Arg 


Gin 


Leu 


65 










70 










75 








80 


Leu 


Pro 


Lys 


Ala 


Pro 


Pro 


Leu 


Arg 


Glu 


Leu 


He 


Asp Gin Tyr Asp Val 


Gin 








85 










90 










95 




Arg 


Asp 


Asp 


Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp Asp Asp 


Tyr His 


Ala 






100 










105 










110 






Thr 


Thr 


Glu 


Thr 


He 


He 


Thr 


Met 


Pro 


Thr 


Glu Ser Asp 


Phe 


Leu 




Gin 


115 










120 










125 








Met 


Val 


Asp 


Gly 


Lys 


Pro 


Lys 


Cys 


Cys 


Phe 


Phe 


Lys 


Phe 


Ser 


Ser 




13 0 










135 










140 








Lys 


He 


Gin 


Tyr 


Asn 


Lys 


Val 


Val 


Lys 


Ala 


Gin 


Leu 


Trp 


He 


Tyr 


Leu 


145 










150 










155 






160 


Arg 


Pro 


Val 


Glu 


Thr 


Pro 


Thr 


Thr 


Val 


Phe 


Val 


Gin 


He 


Leu 


Arg 


Leu 


He 








165 










170 










175 




Lys 


Pro 


Met 
180 


Lys 


Asp 


Gly 


Thr 


Arg 
185 


Tyr 


Thr 


Gly 


He 


Arg 
190 


Ser 


Leu 


Lys 


Leu 


Asp 


Met 


Asn 


Pro 


Gly 


Thr 


Gly 


He 


Trp 


Gin 


Ser 


He 


Asp 


Val 






195 










200 










205 






Lys 


Thr 


Val 


Leu 


Gin 


Asn 


Trp 


Leu 


Lys 


Gin 


Pro 


Glu 


Ser 


Asn 


Leu Gly 




210 










215 










220 










He 


Glu 


He 


Lys 


Ala 


Leu 


Asp 


Glu 


Asn 


Gly 


His 


Asp 


Leu 


Ala 


Val 


Thr 


225 










230 










235 








240 


Phe 


Pro 


Gly 


Pro 


Gly 


Glu 


Asp 


Gly 


Leu 


Asn 


Pro 


Phe 


Leu 


Glu 


Val 


Lys 


Val 


Thr 






245 










250 










255 


Asp 


Thr 


Pro 


Lys 


Arg 


Ser 


Arg 


Arg 


Asp 


Phe 


Gly Leu Asp Cys 








260 










265 










270 






Asp 


Glu 


His 
275 


Ser 


Thr 


Glu 


Ser 


Arg 
280 


Cys 


Cys 


Arg 


Tyr 


Pro 
285 


Leu 


Thr 


Val 


Asp 


Phe 


Glu 


Ala 


Phe 


Gly 


Trp 


Asp 


Trp 


He 


He 


Ala 


Pro 


Lys 


Arg 


Tyr 




290 










295 










300 




Lys 


Ala 


Asn 


Tyr 


Cys 


Ser 


Gly 


Glu 


Cys 


Glu 


Phe 


Val 


Phe 


Leu 


Gin 


Lys 


305 










310 










315 










320 
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Tyr P„ HI. Th, Hi, 

Gly P„ Cy, cys Thr P„ Th. .ys „« Se^ P„ n. »s„ „et Ty. 
Phe Asn .1, 31„ 01„ Ue lie Ty. oly Ly. He Pro „« v,l 

Val Asp Arg Cys Gly Cys Ser 
^'^^ 375 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : einqle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE- 
(B) CLONE: #83 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. 34 

(C) OTHER: 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGCGGATCCG TGGATCTAAA TGAGAACAGT GAGC 

34 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOtTRCE - 
(B) CLONE: #84 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . .37 

(C) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGCGAATTCT CAGGTAATGA TTGTTTCCGT TGTAGCG 

37 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: #100 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. . .20 

(C) OTHER: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 
ACACTAAATC TTCAAGAATA 2 0 

(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Baboon GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1...1055 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG CAA AAA 9 
Met Gin Lys 
1 



CTG CAA CTC TGT GTT TAT ATT TAC CTG TTT ATG CTG ATT GTT GCT GGT 54 
Leu Gin Leu Cys Val Tyr lie Tyr Leu Phe Met Leu lie Val Ala Gly 

5 10 15 20 . 

CCA GTG GAT CTA AAT GAG AAC AGT GAG CAA AAA GAA AAT GTG GAA AAA 95 
Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 

GAG GGG CTG TGT AAT GCA TGT ACT TGG AGA CAA AAC ACT AAA TCT TCA 140 
Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 
40 45 50 

AGA ATA GAA GCC ATT AAA ATA CAA ATC CTC AGT AAA CTT CGT CTG GAA 185 
Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

ACA GCT CCT AAC ATC AGC AAA GAT GCT ATA AGA CAA CTT TTA CCC AAA 230 
Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu Leu Pro Lys 
70 75 80 

GCG CCT CCA CTC CGG GAA CTG ATT GAT CAG TAT GAT GTC CAG AGG GAT 275 
Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 
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GAC AGC AGC GAT QQC TCT rrr rr^i, n^n. ^ 

Asp ser Ser Asp Gly SeJ Su gS S Sn f ^ l""^ ^CG 

eu ^.lu Asp Asp Asp Tyr His Ala Thr Thr 
110 

Si S S S 5S S ^ r r T ™ CM oro 

120 ° Ser Asp Phe Leu Met Gin Val 

■^'^^ 130 



TAG AAT AAA GTG GTA AAG rrn oatv 

... v.. va. ^ S ?S ?S ?| S ^» v-f 

"^"^^ 180 

£^ s s s If? s s r - - «»c 

185 ^ txiy lie Arg Ser Leu Lys Leu Asp 

" 195 

oTy S J- G.. AAO ACA OTC 

200 ^ fj^ Ser He Asp Val Lys Thr Val 

210 

"^"^^ 225 

s 1^; - g - - 2J s ?2 s ^ ?s 

255 260 

^"^^ 275 

S S SS S S ™^ S OTG ™ 

280 ^ ^3 T^J ^'^o Leu Thr Val Asp Phe Glu 

290 

s ifj s ill ill SI ?s r r ™ 

295 ^ P -le lie Ala Pro Lys Arg Tyr Lys Ala Asn 

305 

Ss S ?S S 3^,'.^ S S r f ^ TXT AAT ^ 

3^5 ''^^ lie Asn Met Leu Tyr Phe Asn 

"^^^ 355 



320 



365 



410 



455 



500 



545 



590 



635 



680 



725 



770 



815 



860 



905 



950 



995 
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AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCC ATG GTA GTA GAC CGC 1040 
Lys Glu Gin lie lie Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 

TGC GGG TGC TCA TGA 1055 
Cys Gly Cys Ser 
375 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Baboon GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1...376 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



Met Gin Lys 
1 



Leu Gin 


Leu 


Cys 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Met 


Leu He Val 


Ala 


Gly 


5 








10 










15 






20 


Pro Val 


Asp 


Leu 


Asn 


Glu 


Asn 


Ser 


Glu 


Gin 


Lys 


Glu Asn Val 


Glu 


Lys 








25 










30 






35 


Glu Gly 


Leu 


Cys 


Asn 


Ala 


Cys 


Thr 


Trp 


Arg 


Gin 


Asn Thr Lys 


Ser 


Ser 






40 










45 






50 






Arg He 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys Leu Arg 


Leu 


Glu 




55 










60 








65 






Thr Ala 


Pro 


Asn 


He 


Ser 


Lys 


Asp 


Ala 


He 


Arg 


Gin Leu Leu 


Pro 


Lys 


70 










75 










80 






Ala Pro 


Pro 


Leu 


Arg 


Glu 


Leu 


He 


Asp 


Gin 


Tyr 


Asp Val Gin Arg Asp 


85 








90 










95 






100 


Asp Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp 


Asp 


Asp 


Tyr His Ala 


Thr 


Thr 








105 










110 






115 




Glu ,Thr 


He 


He 


Thr 


Met 


Pro 


Thr 


Glu 


Ser 


Asp 


Phe Leu Met 


Gin 


Val 






120 










125 




130 






Asp Gly 


Lys 


Pro 


Lys 


Cys 


Cys 


Phe 


Phe 


Lys 


Phe 


Ser Ser Lys 


He 


Gin 




135 










140 








145 






Tyr Asn 


Lys 


Val 


Val 


Lys 


Ala 


Gin 


Leu 


Trp 


He 


Tyr Leu Arg 


Pro 


Val 


150 










155 










160 






Glu Thr 


Pro 


Thr 


Thr 


Val 


Phe 


Val 


Gin 


He 


Leu 


Arg Leu He 


Lys 


Pro 


165 








170 










175 




180 


Met Lys 


Asp 


Gly 


Thr 


Arg 


Tyr 


Thr 


Gly 


He 


Arg 


Ser Leu Lys 


Leu 


Asp 








185 










190 






195 


Met Ser 


Pro 


Gly 


Thr 


Gly 


He 


Trp 


Gin 


Ser 


He 


Asp Val Lys 


Thr 


Val 






200 










205 






210 






Leu Gin 


Asn 


Trp 


Leu 


Lys 


Gin 


Pro 


Glu 


Ser 


Asn 


Leu Gly He Glu He 




215 










220 








225 






Lys Ala 


Leu 


Asp 


Glu 


Asn 


Gly 


His 


Asp 


Leu 


Ala 


Val Thr Phe 


Pro 


Gly 


230 










235 










240 




Pro Gly 


Glu 


Asp 


Gly 


Leu 


Asn 


Pro 


Phe 


Leu 


Glu 


Val Lys Val 


Thr 


Asp 


245 








250 










255 




260 


Thr Pro 


Lys 


Arg 


Ser 


Arg 


Arg 


Asp 


Phe 


Gly 


Leu 


Asp Cys Asp 


Glu 


His 








265 










270 






275 




Ser Thr 


Glu 


Ser 


Arg 


Cys 


Cys 


Arg 


Tyr 


Pro 


Leu 


Thr Val Asp 


Phe 


Glu 



280 285 290 
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Al. Ph. Oly Tn, MP n. Xje «a P„ .,3 

m , 30. ^ 

Thr His Leu Val lii^ riy^ -ni ^2 0 

c'' 330 Ala Gly Pro Cy. 

Cys Thr Pro Thr Lys Met Ser Pr--. ti . 

345 ^^"^ "^^^ Leu TVr Phe Asn Gly 

Lys Glu Gin lie He Tyr Gly Lvs Tl. I 355 

360 ^ Ala Met Val Val Asp Arg 

Cys Gly Cys Ser 370 
375 

(2) INFORMATION FOR SEQ ID N0:20: 

( i ) SEQDENCE CHARACTERISTICS • 

B TYPE: nucleic acid 
C STRANDEDNESS: sinqle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Bovine GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . 1055 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



ATG CAA AAA 
Met Gin Lys 
1 



^ s s s s q ™ s zis^y 

80 

S S S S S IK S S S 5If S= - - 



54 

95 

140 

185 

230 

275 

320 
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GAA ACG GTC ATT ACC ATG CCC ACQ GAG TCT GAT CTT CTA ACG CAA GTG 365 
Glu Thr Val lie Thr Met Pro Thr Glu Ser Asp Leu Leu Thr Gin Val 
120 125 130 

GAA GGA AAA CCC AAA TGT TGC TTC TTT AAA TTT AGC TCT AAG ATA CAA 410 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys lie Gin 
135 140 145 

TAG AAT AAA CTA GTA AAG GCC CAA CTG TGG ATA TAT CTG AGG CCT GTC 455 
Tyr Asn Lys Leu Val Lys Ala Gin Leu Trp lie Tyr Leu Arg Pro Val 
150 155 160 

AAG ACT CCT GCG ACA GTG TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC 500 
Glu Thr Pro Thr Ala Val Phe Val Gin lie Leu Arg Leu He Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGG TAT ACT GGA ATC CGA TCT CTG AAA CTT GAC 545 
Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AAC CCA GGC ACT GGT ATT TGG CAG AGC ATT GAT GTG AAG ACA GTG 590 
Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 

TTG CAG AAC TGG CTC AAA CAA CCT GAA TCC AAC TTA GGC ATT GAA ATC 635 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

AAA GCT TTA GAT GAG AAT GGC CAT GAT CTT GCT GTA ACC TTC CCA GAA 680 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Glu 
230 235 240 

CCA GGA GAA GAT GGA CTG ACT CCC TTT TTA GAA GTC AAG GTA ACA GAC 725 
Pro Gly Glu Asp Gly Leu Thr Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

ACA CCA AAA AGA TCT AGG AGA GAT TTT GGG CTT GAT TGT GAT GAA CAC 770 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCC ACA GAA TCT CGA TGC TGT CGT TAC CCT CTA ACT GTG GAT TTT GAA 815 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

GCT TTT GGA TGG GAT TGG ATT ATT GCA CCT AAA AGA TAT AAG GCC AAT 860 
Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCT GGA GAA TGT GAA TTT GTA TTT TTG CAA AAG TAT CCT CAT 905 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyx Pro His 
310 315 320 

ACC CAT CTT GTG CAC CAA GCA AAC CCC AGA GGT TCA GCC GGC CCC TGC 950 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGT ACT CCT ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT TTT AAT GGC 995 
Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

G7VA GGA CAA ATA ATA TAC GGG AAG ATT CCA GCC ATG GTA GTA GAT CGC 1040 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 
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TGT GGG TGT TCA TGA 

Cys Gly Cys Ser 1055 
375 

(2) INFORMATION FOR SEQ ID WO: 21: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

Met Gin Lys 

Leu Gin He Ser Val Tyr He Tyr Leu Phe Met Leu He Val Ala Gly 
Pro val ASP Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn Val Glu lJs 
Glu Gly Leu Cys Asn Ala Cys Leu Trp A^g Qiu Asn Thr Thr Ser Ser 
Arg Leu Glu Ala He Lys He Gin He Leu Ser Lys Leu A^g Leu Glu . 
Thr Ala Pro Asn lie Ser Lys Asp Ala He Arg Gin Leu Leu Pro Lys 
Ala Pro Pro Leu Leu Glu Leu He Asp Gin Phe A^p Val Gin Arg Asp 
Ala ser Ser Asp Gly Ser Leu Glu Asp Asp As'| Tyr His Ala Arg ^h^ 

A'JS 110 



Glu Thr val lie Thr Met Pro Thr Glu Ser Asp Leu Leu Thr G^n Val 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser lys He Gin 
Tyr Asn Lys Leu Val Lys Ala Gin Leu Trp He Tyr Jeu Arg Pro Val 
Glu Thr Pro Thr Ala Val Phe Val Gin He Leu i^l Leu He Lys Pro 
Met Lys ASP Gly Thr Arg Tyr Thr Gly He S Ser Leu Lys Leu ^sp 
Met ser Pro Gly Thr Gly He Trp Gin Hi He Asp Val Lys Sr Val 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Glu 
Pro Gly Glu Asp Gly Leu Thr Pro Phe Leu Glu 111 Lys Val Thr Asp 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu L's 
ser Thr Glu Ser Arg Cys Cys Arg Tyr Hi Leu Thr Val Asp III Glu 
Ala Phe Gly Trp Asp Trp Xle lie Ala Pro Lys Arg Tyr '41 Ala Asn 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin '41 Tyr Pro His 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Hi Ala Gly Pro Cys 
cys T^r Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn G^y 



350 
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Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 

360 365 370 

Cys Gly Cys Ser 
375 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Chicken GDP- 8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1...1055 
(D) OTHER: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 



ATG CAA AAG 9 
Met Gin Lys 
1 

CTG GCA GTC TAT GTT TAT ATT TAC CTG TTC ATG CAG ATC GCG GTT GAT 54 
Leu Ala Val Tyr Val Tyr He Tyr Leu Phe Met Gin He Ala Val Asp 
5 10 15 20 



CCG GTG GCT CTG GAT GGC AGT AGT CAG CCC ACA GAG AAC GCT GAA AAA 
Pro Val Ala Leu Asp Gly Ser Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 



95 



GAC GGA CTG TGC AAT GCT TGT ACG TGG AGA CAG AAT ACA AAA TCC TCC 140 
Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 
40 45 50 

AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGC AAA CTG CGC CTG GAA 185 
Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

CAA GCA CCT AAC ATT AGC AGG GAC GTT ATT AAG CAG CTT TTA CCC AAA 23 0 

Gin Ala Pro Asn He Ser Arg Asp Val He Lys Gin Leu Leu Pro Lys 
70 75 80 

GCT CCT CCA CTG CAG GAA CTG ATT GAT CAG TAT GAT GTC CAG AGG GAC 275 
Ala Pro Pro Leu Gin Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGT AGC GAT GGC TCT TTG GAA GAC GAT GAC TAT CAT GCC ACA ACC 32 0 

Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAG ACG ATT ATC ACA ATG CCT ACG GAG TCT GAT TTT CTT GTA CAA ATG 365 
Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu Val Gin Met 
120 125 130 



GAG GGA AAA CCA AAA TGT TGC TTC TTT AAG TTT AGC TCT AAA ATA CAA 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 
135 140 145 



410 
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255 260 

?S ?ro ^ ^^^^ GGC CTT GAC TGT GAT GAG CAC 

Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp gTS 

-*ibt» 270 



sS StJ I'z'i ?^ CCG CTG ACA GTG GAT TTC GAA 

Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 

285 290 

?S §g ?S S ?Ie r r ^ 

irp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys Ala Asn 

300 

^^'^ GGA GAA TGC GAA TTT GTG TTT CTA GAG AAA TAG CCG CAC 

Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin ^ Tyr Pro Sts 

315 320 

335 



455 



500 



545 



Tyl i^s vli vll f ° ^GG CAA GTC 

iyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Gin Val 

■LSS 160 
"° 175 

205 210 

s s ?s s J?j s S5 s s |5 s 



635 



680 



725 



770 



815 



860 



905 



950 



995 



1040 



S ?S S S S SS S S S 

•^^0 26S ^ ^ 

TGC GGG TGC TCA TGA 

Cys Gly Cys Ser 1055 
375 

(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Chicken GDF-8 
(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..376 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



Met Gin Lys 
1 



Leu 


Ala 


Val 


Tyr 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Met 


Gin 


He 


Ala 


Val Asp 


5 










10 










15 








20 


Pro 


Val 


Ala 


Leu 


Asp 


Gly 


Ser 


Ser 


Glu 


Gin 


Lys 


Glu 


Asn 


Val 


Glu Lys 










25 










■ 30 










35 


Glu 


Gly 


Leu 


Cys 


Asn 


Ala 


Cys 


Thr 


Trp 


Arg 


Gin 


Asn 


Thr 


Lys 


Ser Ser 








40 










45 










50 




Arg 


lie 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 


Leu Arg 


Leu Glu 






55 










60 










65 






Gin 


Ala 


Pro 


Asn 


He 


Ser 


Arg 


Asp 


Val 


He 


Lys 


Gin 


Leu 


Leu 


Pro Lys 




70 










75 










80 






Ala 


Pro 


Pro 


Leu 


Gin 


Glu 


Leu 


He 


Asp 


Gin 


Tyr 


Asp 


Val 


Gin 


Arg Asp 


85 










90 










95 








100 


Asp 


Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp 


Asp 


Asp 


Tyr 


His 


Ala 


Thr Thr 




















JLJ.U 






11 


Glu 


Thr 


lie 


He 


Thr 


Met 


Pro 


Thr 


Glu 


Ser 


Asp 


Phe 


Leu 


Val 


Gin Met 








120 










125 








130 




Glu 


Gly 


Lys 


Pro 


Lys 


Cys 


Cys 


Phe 


Phe 


Lys 


Phe 


Ser 


Ser 


Lys 


He Gin 






135 










140 










145 




Tyr 


Asn 


Lys 


Val 


Val 


Lys 


Ala 


Gin 


Leu 


Trp 


He 


Tyr 


Leu Arg 


Gin Val 




150 










155 










160 








Gin 


Lys 


Pro 


Thr 


Thr 


Val 


Phe 


Val 


Gin 


He 


Leu 


Arg 


Leu 


He 


Lys Pro 


165 










170 










175 






180 


Met 


Lys 


Asp 


Gly 


Thr 


Arg 


Tyr 


Thr 


Gly 


He 


Arg 


Ser 


Leu 


Lys 


Leu Asp 










185 










190 








195 


Met 


Ser 


Pro 


Gly 


Thr 


Gly 


He 


Trp 


Gin 


Ser 


He 


Asp 


Val 


Lys 


Thr Val 








200 










205 








210 




Leu 


Gin 


Asn 


Trp 


Leu 


Lys 


Gin 


Pro 


Glu 


Ser 


Asn 


Leu Gly 


He 


Glu He 






215 










220 










225 






Lys 


Ala 


Phe 


Asp 


Glu 


Thr 


Gly 


Arg 


Asp 


Leu 


Ala 


Val 


Thr 


Phe 


Pro Gly 




230 










235 










240 






Pro 


Gly 


Glu 


Asp 


Gly 


Leu 


Asn 


Pro 


Phe 


Leu 


Glu 


Val Arg Val 


Thr Asp 


245 










250 










255 








260 


Thr 


Pro 


Lys 


Arg 


Ser 


Arg 


Arg 


Asp 


Phe 


Gly 


Leu 


Asp 


Cys 


Asp 


Glu His 




Thr 






265 










270 




275 


Ser 


Glu 


Ser 


Arg 


Cys 


Cys 


Arg 


Tyr 


Pro 


Leu 


Thr 


Val 


Asp 


Phe Glu 


Ala 






280 










285 










290 




Phe 


Gly 


Trp 


Asp 


Trp 


He 


He 


Ala 


Pro 


Lys 


Arg Tyr 


Lys 


Ala Asn 






295 










300 








305 




Tyr 


Cys 


Ser 


Gly 


Glu 


Cys 


Glu 


Phe 


Val 


Phe 


Leu 


Gin 


Lys 


Tyr 


Pro His 




310 










315 










320 




Thr 


His 


Leu 


Val 


His 


Gin 


Ala 


Asn 


Pro 


Arg 


Gly 


Ser Ala Gly Pro Cys 


325 


Thr 








330 










335 








340 


Cya 


Pro 


Thr 


Lys 


Met 


Ser 


Pro 


He 


Asn 


Met 


Leu 


Tyr 


Phe Asn Gly 










345 










350 








355 


Lys 


Glu 


Gin 


He 


He 


Tyr 


Gly 


Lys 


He 


Pro 


Ala 


Met 


Val 


Val 


Asp Arg 



360 365 370 
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Cys Gly Cys Ser 
375 



, (2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS- 
CA) LENGTH: 1276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: Rat GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1...1276 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



ATG ATT CAA A7^ 
Met He Gin Lys 

TTT GTG CTG ATT GCT GCT GGC 
Tyr Val Tyr He Tyr Leu Phe Val Leu He Ala Ala Gly 
^0 * 15 20 

CTA AAT GAG GAC AGT GAG AGA GAG GCG AAT GTG GAA AAA 
Leu Asn Glu Asp Ser Glu Arg Glu Ala Asn Val Glu Lys 
25 30 35 ^ 

TGT AAT GCG TGT GCG TGG AGA CAA AAC ACA AGG TAC TCC 
Cys Aen Ala Cys Ala Trp Arg Gin Asn Thr Arg Tyr Ser 

45 50 

AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGT AAA CTC CGC CTG GAA 
Arg lie Glu Ala He Lys He Gin He Leu Ser Lys Leu Sg Leu gJS 

?hr P^^ r^^ "^^^ AAA GAT GCT ATA AGA CAA CTT CTG CCC AGA 

Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu Leu Pro ^ 

75 80 



CCG CAA ATG 
Pro Gin Met 
5 

CCA GTG GAT 
Pro Val Asp 



GAG GGG CTG 
Glu Gly Leu 



GCG CCT CCA CTC 
Ala Pro Pro Leu 
85 

GAC AGC AGT GAC 
Asp Ser Ser Asp 



CGG GAA CTG ATC GAT CAG TAC GAC GTC CAG AGG GAT 
Arg Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 

^5 100 

GGC TCT TTG GAA GAT GAC GAT TAT CAC GCT ACC ACG 
Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 

110 

ACC ATG CCT ACC GAG TCT GAC TTT CTA ATG CAA GCG 
Thr Met Pro Thr Glu Ser Asp Phe Leu Met Gin Ala 
125 130 

S^^S ^ ?S S - SJ^ IS s s ^ - 



GAA ACA ATC ATT 
Glu Thr He He 
120 



115 



163 



211 



259 



307 



355 



403 



451 



499 



547 
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TAG AAC AAA GTG GTA AAG GCC CAG CTG TGG ATA TAT CTG AG A GCC GTC 595 

Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu Arg Ala Val 

150 155 160 

AAG ACT CCT ACA ACA GTG TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC 643 

Lys Thr Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu He Lys Pro 

170 175 180 



ATG AAA GAC GGT ACA AGG TAT ACC GGA ATC CGA TCT CTG AAA CTT GAC 691 
Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AGC CCA GGC ACT GGT ATT TGG CAG AGT ATT GAT GTG AAG ACA GTG 73 9 

Met Ser Pro Gly Thr Gly tie Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 

TTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAC TTA GGC ATT GAA ATC 787 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

AAA GCT TTG GAT GAG AAT GGG CAT GAT CTT GCT GTA ACC TTC CCA GGA 835 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

CCA GGA GAA GAT GGG CTG AAT CCC TTT TTA GAA GTC AAA GTA ACA GAC 683 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

ACA CCC AAG AGG TCC CGG AGA GAC TTT GGG CTT GAC TGC GAT GAA CAC 931 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCC ACG GAA TCG CGG TGC TGT CGC TAC CCC CTC ACG GTC GAT TTC GAA 979 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

GCC TTT GGA TGG GAC TGG ATT ATT GCA CCC AAA AGA TAT AAG GCT AAT 1027 
Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCT GGA GAG TGT GAA TTT GTG TTC TTA CAA AAA TAT CCG CAT 1075 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAT CTT GTG CAC CAA GCA AAC CCC AGA GGC TCG GCA GGC CCT TGC 1123 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGC ACG CCA ACA AAA ATG TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC 1171 
Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCC ATG GTA GTA GAC CGG 1219 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 

TGT GGG TGC TCG TGA 1276 
Cys Gly Cys Ser 
375 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: Rat GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..376 
(D) OTHER:' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met He Gin Lys 

Pro Gin Met Tyr Val Tyr He Tyr Leu Phe Val Leu He Ala Ala Gly 
Pro Val Asp Leu Asn Glu Asp Ser Glu Arg gJu Ala Asn Val Glu Lys 
Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn Thr Arg l^r Ser 
Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu A^g Leu Glu 



/V _ — 

Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
Asp ser ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr JS? 
Glu Thr He He Thr Met Pro Thr Glu III Asp Phe Leu Met G^n Ala 
Asp Gly Lys Pro Lys Cys Cys Phe HI Lys Phe Ser Ser lyl He Gin 
Tyr Asn Lys Val Val Lys Ala ll'n Leu Trp He Tyr lil Arg Ala Val 
Lys Thr Pro Thr Thr Val Phe Val Gin He Leu J^g Leu He Lys Pro 
Met Lys Asp Gly Thr Arg Tyr Thr Gly He Ser Leu Lys Leu lap 



Met ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys ill Val 

Glu Ser Asn Leu 

Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val ?hr Phe Pro Gly 



^uu 205 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 



Pro Gly Glu Asp Gly Leu IJn Pro Phe Leu Glu VeS. Lys Val Thr Asp 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu III 
ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp III Glu 
Ala Phe Gly Trp Asp Trp He He III Pro Lys Arg Tyr Lys Ala Asn 
Tyr cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin lyl Tyr Pro His 
Thr His Leu Val His Gin Ala Asn Pro Arg. Gly Ser Ala Gly Pro Cys 
cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn lly 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val l^p Arg 

Cys Gly Cys Ser ^"^^ 
375 
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(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Turkey GDF-8 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1...1055 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ATG CAA AAG 9 
Met Gin Lys 
1 

CTA GCA GTC TAT GTT TAT ATT TAC CTG TTC ATG CAG ATT TTA GTT CAT 54 
Leu Ala Val Tyr Val Tyr lie Tyr Leu Phe Met Gin lie Leu Val His 
5 10 15 20 

CCG GTG GCT CTT GAT GGC AGT AGT CAG CCC AC A GAG AAC GCT GAA AAA 95 
Pro Val Ala Leu Asp Gly Ser Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 

GAC 6GA CTG TGC AAT GCT TGC ACG TGG AGA CAG AAT ACT AAA TCC TCC 140 
Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 
40 45 50 

AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGC AAA CTG CGC CTG GAA 185 
Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

CAA GCA CCT AAC ATT AGC AGG GAC GTT ATT AAA CAA CTT TTA CCC AAA 23 0 

Gin Ala Pro Asn He Ser Arg Asp Val He Lys Gin Leu Leu Pro Lys 
70 75 80 

GCT CCT CCG CTG CAG GAA CTG ATT GAT CAG TAT GAC GTC CAG AGA GAC 275 
Ala Pro Pro Leu Gin Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGT AGC GAT GGC TCT TTG GAA GAC GAT GAC TAT CAT GCC ACA ACC 320 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAA ACG ATT ATC ACA ATG CCT ACG GAG TCT GAT TTT CTT GTA CAA ATG 365 
Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu Val Gin Met 
120 125 130 

GAG GGA AAA CCA AAA TGT TGC TTC TTT AAG TTT AGC TCT AAA ATA CAA 410 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 
135 140 145 

TAT AAC AAA GTA GTA AAG GCA CAA TTA TGG ATA TAC TTG AGG CAA GTC 455 
Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Gin Val 
150 155 160 
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"5 



185 



500 



545 



590 



635 



680 



770 



S S |5 S ?S S S 2^ 

235 240 

Pro SI ^ ^ -^T^ GAG GTC AGA GTT ACA GAC 

Pro Gly Glu Asp Gly Lau Asn Pro Phe Leu Glu Val Arg Val ^ 2p 

260 

?S PrS ^ l"^ o""^ •^^^ "^^T GGC CTT GAC TGC GAC GAG CAC 

Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Sp Ss 

" 270 . 275 

^^"^ r''" '^'^ CTG ACA GTG GAT TTT GAA 

Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp 

285 290 

J^J S ?f r ^ ^ 

^xy irp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys Ala Asn 
330 

350 

ij^ ^<^^ 

360 ^ ^ ^ ^""^ Asp Arg 

TGC GGG TGC TCA TGA 

Cys Gly Cys Ser 1055 
375 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



815 



860 



905 



950. 



995 



1040 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Turkey GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..376 
(D) OTHER: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Gin Lys 
1 



Leu Ala 


Val 


Tvr 


Val 


Tyr 


He 


lyr 


Leu 




Gin 


He 


Leu 


V ax 


tixs 


5 








10 








XO 










^0 


Pro Val 


Ala 


Leu 


Asp 


Gly 


OCX 




OXU 




Glu 


Asn 


Val 


oXU 


Lys 








25 


















35 


Glu Gly 


Leu 




Asn 


Ala 




XllX 


irp 


Arg Gin 


Asn 


Thr 


Lys 


Ser 


Ser 






/in 


















50 






Arcf He 


Glu 


Ala 


He 


Lys 


X Xkz 


IjXIl 


TT o 

X xe 


Leu Ser 


Lys 


Leu 


Arg 


Leu 


QtIU 














o U 








65 






Gin Ala 


Pro 


Asn 


He 


Ser 


Arg 


Asp 


V dX 


xJLe Lys 


Gin 


Leu 


Leu 


Pro 


Lys 


70 










/ D 








80 








Ala Pro 


Pro 


Leu 


Gin 


Glu 


Leu 


X xc 


Asp 


^7X11 Xy r 


Asp Val 


Gin 


Arg 


Asp 


D 13 








90 








95 










100 




Ser 


Asp 


Gly 


Ser 


Xitr U- 




Asp 


Asp Asp 


Tyr His Ala 


inr 


inr 








105 










110 








115 




Glu Thr 


X J. c 


n *^ 

X xc 


Thr 


Met 


Pro 


iiir 


oXU 


oer Asp 


Phe 


Leu 


Val 


Gin 


Met 






Ton 
xzu 










TIC 






130 
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CLAIMS 

1 . A transgenic non-human animal having a transgene disrupting or interfering 
with expression of growth differentiation factor-8 (GDF-8) chromosomally 
integrated into the germ cells of the animal. 

2. The transgenic animal of claim 1 , wherein the animal is selected from the 
group of species consisting of avian, bovine, ovine, piscine, murine, and 
porcine. 

3. The transgenic animal of claim 1 where the species is avian. 

4. The transgenic animal of claim 1 where the species is bovine. 

5. The transgenic animal of claim 1 where the species is porcine. 

6. The transgenic animal of claim 1 where the species is ovme. 

7. The transgenic animal of claim 1 where the species is piscine. 

8. The transgenic animal of claim 1, wherein the transgene comprises GDF-8 
antisense polynucleotide(s). 

9. The transgenic animal of claim 1 , wherein the transgene comprises a gene e- 
ncoding a dominant negative GDF-8 polypeptide. 

10. The transgenic animal of claim 1 , wherein the animal is homozygous or 
heterozygous for GDF-8 polynucleotide. 

11. A chicken or turkey egg produced by the transgenic animal of claim 3. 
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1 2. Beef obtained from the transgenic animal of claim 4. 

13. Milk obtained from the transgenic animal of claim 4. 

14. Pork obtained from the transgenic animal of claim 5, 

15. Lamb obtained from the transgenic animal of claim 6. 

1 6. Chicken or turkey meat produced by the transgenic animal of claim 3 . 

17. A method of producing animal food products having increased muscle mass 
comprising: 

a) introducing a transgene disrupting or interfering with expression of 
groAvth differentiation factor-8 (GDF-8) into an embryo into germ cells of a 
pronuclear embryo of the animal; 

b) hnplanting the embryo into the oviduct of a pseudopregnant female 
thereby allovraig the embryo to mature to fiill term progeny; 

c) testing the progeny for presence of the transgene to identify 
transgene-positive progeny; 

d) cross-breeding transgene-positive progeny to obtain fiuther 
transgene-positive progeny; and 

e) processing the progeny to obtain foodstuff. 

1 8. The method of claim 17, wherein the transgene comprises GDF-8 antisense 
polynucleotides. 

1 9. The method of claim 17, wherein the transgene comprises a gene encoding a 
dominant negative GDF-8 polypeptide. 

20. A method of producing avian food products having reduced cholesterol levels 
comprising: 
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a) introducing a transgene disrupting or interfering with expression of 
growth differentiation factor-8 (GDF-8) into an embryo of an avian animal; 

b) culturing the embryo under conditions whereby progeny are 
hatched; 

c) testing the progeny for presence of the transgene to identify 
transgene-positive progeny; 

d) cross-breeding transgene-positive progeny; and 

e) processing the progeny to obtain foodstuff. 

21. The method of claim 20, wherein the transgene comprises GDF-8 antisense 
polynucleotides. 

22. The method of claim 20, wherein the transgene comprises a gene encoding a 
dominant negative GDF-8 polypeptide. 

23. The transgenic animal of claim 20, wherein the transgene comprises a polynu- 
cleotide encodmg a truncated GDF-8 polypeptide. 

24. A method for increasing the muscle mass in an animal comprising admmister- 
ing to the animal an antibody, or fragment thereof, which binds to GDF-8 
polypeptide. 

25. The method of claim 24, wherein anti-GDF-8 antibody is administered to a 
domesticated animal. 

26. The method of claim 24, wherein the antibody is a monoclonal antibody or a 
polyclonal antibody. 

27. The method of claim 24, wherein the anti-GDF-8 antibody is administered by 
intravenous, intramuscular, multiple bolus, or subcutaneous injections. 

28. The method of claim 27, wherein the anti-GDF-8 antibody is administered 
within a dose range of 0. 1 ug/kg to 1 00 mg/kg. 
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The method of claim 27, wherein the antibody is formulated in a formulation 
suitable for administration by injection into an animal. 

A method of inhibiting the growth regulating actions of GDF-8 comprising c- 
ontacting a GDF-8 agent with fetal or adult muscle cells or progenitor cells. 

The method of claim 30, wherein the agent is selected from the group consist- 
ing of a monoclonal antibody, an antisense nucleic acid and a dominant 
negative encoding nucleic acid sequence or polypeptide. 

The method of claim 31, wherein the antibody is a humanized monoclonal 
antibody or a chimeric monoclonal antibody or fragment thereof. 

The method of claim 30, wherein the agent is administered to a patient suffer- 
ing from a disorder selected from the group consisting of muscle wasting 
disease, neuromuscular disorder, muscle atrophy and aging. 

The method of claim 30, wherein the agent is administered to a patient suffer- 
ing from a disorder selected from the group consisting of muscular dystrophy, 
spinal cord injury, traumatic injury, congestive obstructive pulmonary disease 
(COPD), AIDS and cachechia. 

The method of claim 30, wherein the agent is administered to a patient with 
muscle wasting disease or disorder by intravenous, intramuscular or subcuta- 
neous injection. 

The method of claim 3 1 , wherein the monoclonal antibody is administered 
within a dose range between about 0. 1/kg to about 1 00 mg/kg. 

The method of claim 31 , wherein the monoclonal antibody is formulated in a 
formulation suitable for administration to a patient. 
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38. A method for treating a muscle or adipose tissue disorder in a subject, com- 
prising administering a therapeutically effective amount of a GDF-8 agent to 
the subject, thereby inhibiting abnormal growth of muscle or adipose tissue. 

39. The method as in claim 38, wherein the GDF-8 agent is selected from the 
group consisting of an antisense polynucleotide, a polynucleotide encoding a 
dominant negative GDF-8 polypeptide, a GDF-8 antibody and a polynucleo- 
tide encoding a truncated GDF-8 polypeptide. 

40. The method of claim 38, wherein the disorder is a cancer selected from the 
group consisting of muscle, connective tissue, or bone. 

41 . The method of claim 38, wherein the subject has an obesity disorder. 

42. A method ofinhibiting the growth regulating actions ofGDF-8 in a subject 
comprising administering to the subject, a GDF-8 agent that inhibits the action 
of GDF-8 in the subject. 

43 . The method of claim 42, wherein the GDF-8 agent is selected from the group 
consisting of an antisense polynucleotide, a polynucleotide encoding a domi- 
nant negative GDF-8 polypeptide, a GDF-8 antibody and a polynucleotide 
encoding a truncated GDF-8 polypeptide. 
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44. A method for identifying a compound that affects GDF-8 activity or gene 
expression comprising: 

a) incubating the compound with GDF-8 polypeptide, or with a 
recombinant cell expressing GDF-8 under conditions sufficient to allow the compo- 
nents to interact; and 

b) determining the effect of the compound on GDF-8 activity or 

expression. 



45. The method of claim 44, wherein the effect is inhibition of GDF-8 activity or 
expression. 

46. The method of claim 44, wherein the effect is stimulation of GDF-8 activity or 
expression. 

47. An isolated polynucleotide encoding a truncated GDF-8 polypeptide wherein 
the truncation is a loss of the C-terminal active fragment of GDF-8. 



48. 



The isolated polynucleotide of claim 47, wherein the polynucleotide is as 
shown in FIGURE 12a. 
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rTAAGGTAGGAAGGATTTCAGGCTCTATTTACATAATTCTTCTTTCCTTTrCACACAGAA 



60 

N 



6i raTTTTAGAAGTCAAGGTGACAGACACACCCAAGAGGTCCCGGAGAGACTTTGGGCI 120 

121 WGCGATGAGCACTCCACGGAATCCCGGTGCTGCCGCTAcWTCACGCm 180 
181 irJrJr.LtlJ.J.JJ^ ^ C R Y P L T V D F 



TGAAGCCTTTGGATGGGACTGGATTATCGCACCCAAAAGATATAAGGCCAATTACTGCTC 240 
'''^'^CWOWIlAPKRYKANYr'^ 
AGGAGAGTGTGAATTTGTGTTTTTACAAAAATATCCGCATACTCATCT^ 500 
Lrrr ^ ^ Q K Y P H T H L V H Q A 



301 AMCCCCAGAGGCTCAGCAGGCCCTTGCTGCACTCCGACAAAAATGTCTC^^^ 360 

NPRGSAGPCCTPTKMSPINM 
361 GCTATATTTTMTGGCAAAGAACAAATAATATATGGGAAAA 

LiFNGKEOliYCKIPAMVvn 
421 CCGCTGTGGGTGCTCAT^^^^ 

S^^™"^^'^ 540 

FIG.2a 



^ ^^J^^^F^^ATpTaTCTTGACTGTGATGAGCACTCAACAGAATCAO^^ 60 



liLRj S (O D F G L D C D E H S T E S R r 
61 GCTGTCGTTACCCTCTAACTGTGGATTTTGAAGCTTTTGGATGGGATTGGATTATOGCTC 120 

LKYPLTVDFEAFGWOWIIAP 
121 CTAAAAGATATAAGGCCAATTACTGCTCTGGAGAGTGTGAATTTGTATrTT ,60 

KHYKANYCSGECEFVFLOKY 
181 ATCCTCATACTCATCTGGTACACCAAGCAAACCCCAGAGGTTCAGCAGGCCCTrGCTGTA 240 

PHTHLVHQANPRGSAGPrri 
241 CTCCCACAAAGmCTCCAATTAATAm^ ^ 

f'IKMSPlNMLYFNGKFOI iv 
301 ATGGGAAAATTCCAGaATGGTAGTA 326 ^ »^ t U I I Y 

G K I P A M V V 

FIG.2b 



wo 98/33887 



4/32 



PCTrtJS98/02479 



GAA GAT GCO CTG AAT CCC TTT TTA GAA GTC A.\A GTA ACA GAC ACA CCC AAG AGG TCC CCC 
£"0L M P F L EVKVTDT P KRSR 

AGA GAC TT: GGG CTT GAC TGT GAT GAA CAC TCC ACG GAA TCG CGG TGC TGT CCC T^iC CCC 
•'^^'-'GLDCDEKiTESRCCRYp 

CTC ACG GTC GAT TTC GAA GCC TTT GGA TGG GAC TGG ATT ATT GCA CCC AAA AGA TAT AAG 
i' ^^'DFEAFGWDwriAPKRVK 

GC7 TAC TGC TCT GGA GAG TGT GAA TTT GTG TTC TTA C.AA AAA TAT CCG CAT ACT CAT 

ANYC S G EC ErVF LQ K Y P KT* K 

CTT GTG CAC CAA GCA AAC CCC AGA GGC TCG GCA GGC CCT TGC TGC ACG CCA ACA AA.^ ATG 
LvHQANPRGSAGPCCTPTKM 

TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC AAA GAA CAA ATA ATA TAT GGG AAA AIT CCA 
SPrNMLYFNGKEQ-riYGKIP 

GCC ATG GTA GTA GAC CGG TGT GGG TGC TCG TGA GCT TTG CAT TAG CTT TAA AAT TTC CCX 
AMVVDRCGCS 

AAT CGT GGA AGG TCT TCC CCT CGA TIT CGA AAC TGT GAA TTT ATG TAC CAC AGG CTG TAG 



Rat GDF-8 



FIG.. 2c 
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T7A CTA oTA AAG GCA CAA TTA TGG ATA TAC TTr arr , ^-.^ 
^ V V K A Q c W r Y ^ o ^"^"^ ""'^^ ^ ^^'^ ACG GTG 
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«C CAA A« AAA AAC CTA TAT CCC CK A,,: CAT CT TCA AAC TO OAA ATT «:= TAC OCT 



AGG CAT TGC C 



Chicken GDF-8 



FIG. 2d 
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.21 ^crArcTmrATTrAcaGTraiGCTCAnccrGci™^^^^ ,8o 

181 COT«;rCAG«MCAAMTGTGCAAAAK™ 

301 \"~7™CM.2.,|crJr^^^^^^^ 

361 mCICmAACTGAmiaGTA(mTCCW^ 

P I- R E I. I 0 Q r 0 V Q R 0 0 s <; n r V 
421 TTTffiMCATCACGATIATCACCCIA(XM^ 

481 TCAcmciMTcmracA^cccUfcTmnr^i^^^^^ 54o 
54, ^^mwm>mAm^ ^ 

601 IpCCTACAACACIGTITCrCCAAATTO^^ ^ 

66, CIAIACTOAAICaATCICIGAAAaTICAaW^ 

'''^'"SLKLOMSPGTGlwnt; 
72, TArTGATGIGAACACAGrCTICMAAAirCCCICAAAa^ ,80 

I D V K T V L Q N W I K Q P F s N I r r 
781 KMATCAAAaTTTGCAIGAMTOCATGATCTTGCrGTW 

'^''^'^'■DENGHDLAVTFPrPr 
84, ««GATWCI6JATOITTIMA«^^ 

tUGLNPFLEVKVTOTP k Ip c p " 
90, WTTraiTMmATG«CACrTO^ 35„ 

JiJiJFGLDCDEHSTFSRrrRVD 
96, C^TCmTOriirGAACCCUTGCATCCCACrGGATTA^ ,02„ 

•-iVDFEAFGWDtfllAPKPYi^ 
102, GG(mTTACrCCMGTGAATTI^^^^^ „ 

"08, TCIIOrCCACCAACC^^ 

1.4, mTTAAUTGCIAm^^ 

.20, ™AGmTGTmTCA^I«^ 
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1251 AAGTCATGGAAGGTCTTCCCCTCAATTTCGAAACTGTGAATTCAAGCACCACAGGCTGTA 1320 

1321 GGCCTTGAGTATGCTCTAGTAACGTAAGCACAAGCTACAGTGTATGAACTAAAAGAGAGA 1380 

1381 ATAGATGCAATGGTTGGCATTCAACCACCAAAATAAACCATACTATAGGATGTTGTATGA 1440 

1441 TTTCCAGAGTTTTTGAAATAGATGGAGATCAAATTACA7TTATGTCCATATATGTATATT 1500 

1501 ACAACTACAATCTAGGCAAGGAAGTGAGAGCACATCTTGTGGTCTGCTGAGTTAGGAGGG 1560 

1561 TATGATTAAAAGGTAAAGTCTTATTTCCTAACAGTTTCACTTAATATTTACAGAAGAATC 1620 

1621 TATATGTAGCCTTTGTAAAGTGTAGGATTGTTATCATTTAAAAACATCATGTACACTTAT 1680 

1681 ATTTGTATTGTATACTTGGTAAGATAAAATTCCACAAAGTAGGAATGGGGCCTCACATAC 1740 

1741 ACATTGCCATTCCTATTATAATTGGACAATCCACCACGGTGCTAATGCAGTGCTGAATGG 1800 

1801 CTCCTACTGGACCTCTCGATAGAACACTCTACAAAGTACGAGTCTCTCTCTCCCTTCCAG 1860 

1861 GTGCATCTCCACACACACAGCACTAAGTGTTCAATGCATTTTCTTTAAGGAAAGAAGAAT 1920 

1921 CTTTTTTTCTAGAGGTCAACTTTCAGTCAACTCTAGCACAGCGGGAGTGACTGCTGCATC 1980 

1981 TTAAAAGGCAGCCAAACAGTATTCATTTTTTAATCTAAATTTCAAAATCACTGTCTGCCT 2040 

2041 TTATCACATGGCAATTTTGTGGTAAAATAATGGAAATGACTGGTTCTATCAATATTGTAT 2100 

2101 AAAAGACTCTGAAACAATTACATTTATATAATATGTATACAATATTGTTTTGTAAATAAG 2160 

2161 TGTCTCCTTTTATATTTACTTTGGTATATTTTTACACTAATGAAATTTCAAATCATTAAA 2220 

2221 GTACAAAGACATGTCATGTATCACAAAAAAGGTGACTGCTTCTATTTCAGAGTGAATTAG 2280 

2281 CAGATTCAATAGTGGTCTTAAAACTCTGTATGTTAAGATTAGAAGGTTATATTACAATCA 2340 

2341 ATTTATGTATTTTTTACATTATCAACTTATGGTTTCATGGTGGCTGTATCTATGAATGTG 2400 

2401 GCTCCCAGTCAAATTTCAATGCCCCACCATTTTAAAAATTACAAGCATTACTAAACATAC 2450 

2461 CAACATGTATCTAAAGAAATACAAATATGGTATCTCAATAACAGCTACTTTTTTATTTTA 2520 

2521 TAATTTGACAATGAATACATTTCTTTTATTTACTTCAGTTTTATAAATTGGAACTTTGTT 2580 

2581 TATCAAATGTATTGTACTCATAGCTAAATGAAATTATTTCTTACATAAAAATGTGTAGAA 2640 

2641 ACTATAAATTAAAGTGTTTTCACATTTTTGAAAGGC 2676 
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12 
18 
24 
30 
36 
42 
48 



AAGAAAAGTAAAAGGAAGAAACAAGAACAAGAAAAAAGATTATATTGATTTTAAAATCAT 60 

CCAAAAACTGCAACTCTGTGTTTATATTTACCTGTTTATGCTGATrGTTGCTGGTCCAGT 120 
Q X L 0 L C V Y I Y L F M L I V A G P V 
GGATCTAAATGAGAACAGTGAGCAAAAAGAAAATGTGGAAAAAGAGGGGCTGTGTAATGC 180 
0 L N E N S E Q K E N V E K E G L C N A 
ATGTACTTGGAGACAAAACACTAAATCTTCAAGAATAGAAGCCATTAAGATACAAATCCT 240 
C T W R Q N T K S S R 1 E A I K i 0 I L 
CAGTAAACTTCGTCTGGAAACACCTC CTAACATCAG CAAAGArGTTATAAGACAArTTTT m 
S K L R L E T A P [EEll K 0 V I R Q L L 
ACCCAAAGCTCCTCCACTCCGGGAACTGATTGATCAGTATGATGTCCAGAGGGATGACAG 360 
P K A P P L R E L 1 D Q Y D V Q R 0 D S 
CAGCGATGGCTCTTTGGAAGATGACGATTATCACGCTACAACGGAAACAATCATTACCAT 420 
SOGSLEDDOYHAITETIITM 
GCCTACAGAGTCTGATTTTCTAATGCAAGTGGATGGAAAACCCAAATGTTGCTTCTTTAA 480 
PTESOFLMOVOGKPKCCFFK 
ATTTAGCTCTAAAATACAATACAATAAAGTAGTAAAGGCCCAACTATGGATATATTTGAG 540 
FSSKIQYNKVVKAQLWIYLR 
541 ACCCGTCGAGACTCCTACAACAGTGTTTGTGCAAATCCTGAGACTCATCAAACCTATGAA 600 

PVETPTTVFVQILRLIKPMK 
601 AGACGGTACAAGGTATACTGGAATCCGATCTCTGAAACTTGACATGAACCCAGGCACTGG 660 

0 G T R Y T G I R S L K L D M N P G T G 
661 TATTTGGCAGAGCATTGATGTGAAGACAGTGTTGCAAAATTGGCTCAAACAACCTGAATC 720 

iWQSIDVKTVLQNWLKQPES 
721 CAACTTAGGCATTGAAATAAAAGCTTTAGATGAGAATGGTCATGATCTTGCTGTAACCTT 780 

NLGIEIKALDENGHDLAVTF 
781 CCCAGGACCAGGAGAAGATGGGCTGAATCCGTTTTTAGAGGTCAAGGTAACAGACACACC 840 

PGPGEDGLNPFLEVKVTOTP 
841 AAAM?ATCCAGAAGGGATTTTGGTnTTnAPTnTnATr.flrrArTrAArA^/^^T^/^r(^;yTC 900 
K IR S R RI D F G L 0 C D E H S T E S R C 
CTGTCGTTACCCTCTAACTGTGGATTTTGAAGCTTTTGGATGGGATTGGATTATCGCTCC 960 
CRYPLTVDFEAFGWDWI lAP 

TAAAAGATATAAGGCCAATTACTGCTCTGGAGAGTGTGAATTTGTATTTTTACAAAAATA 1020 
KRYKANYCSGECEFVFLQKY 
TCCTCATACTCATCTGGTACACCAAGCAAACCCCAGAGGTTCAGCAGGCCCTTGCTGTAC 1080 
PHTHLVHQANPRGSAGPCCT 
TCCCACAAAGATGTCTCCAATTAATATGCTATATTTTAATGGCAAAGAACAAATAATATA 1 140 
P T K M S P I N M L Y F N G K E Q I T Y 

TGGGAAAATTCCAGCGATGGTAGTAGACCGCTGTGGGTGCTCATGAGATTTATATTAAGC 1 200 
GKIPAMVVORCGCS* 

FIG.Sc 



90 
96 
102 
108 
114 



W09803887 



12/32 



PCTAJS98/02479 



1201 GTTCATAACTTCCTAAAACATGGAAGGTTTTCCCCTCAACAATTTTGAAGCTGTGAAATT 1260 

1261 AAGTACCACAGGCTATAGGCCTAGAGTATGCTACAGTCACTTAAGCATAAGCTACAGTAT 1320 

1321 GTAAACTAAAAGGGGGAATATATGCAATGGTTGGCATTTAACCATCCAAACAAATCATAC 1380 

1381 AAGAAACTTTTATGATTTCCAGACTTTTTGAGCTAGAAGGAGATCAAATTACATTTATGT 1440 

1441 TCCTATATATTACAACATCGGCGAGGAAATGAAAGCGATTCTCCTTGAGTTCTGATGAAT 1500 

1501 TAAAGGAGTATGCTTTAAAGTCTATTTCTTTAAAGTTTTGTTTAATATTTACAGAAAAAT 1560 

1561 CCACATACAGTATTGGTAAAATGCAGGATTGTTATATACCATCATTCGAATCATCCTTAA 1620 

1621 ACACTTGAATTTATATTGTATGGTAGTATACTTGGTAAGATAAAATTCCACAAAAATAGG 1680 

1681 GATGGTGCAGCATATGCAATTTCCATTCCTATTATAATTGACACAGTACATTAACAATCC 1740 

1741 ATGCCAACGGTGCTAATACGATAGGCTGAATGTCTGAGGCTACCAGGTTTATCACATAAA 1800 

1801 AAACATTCACTAAAATAGTAAGTTTCTCTTTTCTTCAGGTGCATTTTCCTACACCTCCAA 1860 

1851 ATGAGGAATGGATTTTCTTTAATGTAAGAAGAATCATTTTTCTAGAGGTTGGCTTTCAAT 1920 

1921 TCTGTAGCATACTTGGAGAAACTGCATTATCTTAAAAGGCAGTCAAATGGTGTTTGTTTT 1980 

1981 TATCAAAATGTCAAAATAACATACTTGGAGAAGTATGTAATTTTGTCTTTGGAAAATTAC 2040 

2041 AACACTGCCTTTGCAACACTGCAGTTTTTATGGTAAAATAATAGAAATGATCGACTCTAT 2100 

2101 CAATATTGTATAAAAAGACTGAAACAATGCATTTATATAATATGTATACAATATTGTTTT 2160 

2151 GTAAATAAGTGTCTCCTTTTTTATTTACTTTGGTATATTTTTACACTAAGGACATTTCAA 2220 

2221 ATTAAGTACTAAGGCACAAAGACATGTCATGCATCACAGAAAAGCAACTACTTATATTTC 2280 

2281 AGAGCAAATTAGCAGATTAAATAGTGGTCTTAAAACTCCATATGTTAATGATTAGATGGT 2340 

2341 TATATTACAATCATTTTATATTTTTTTACATGATTAACATTCACTTATGGATTCATGATG 2400 

2401 GCTGTATAAAGTGAATTTGAAATTTCAATGGTTTACTGTCATTGTGTTTAAATCTCAACG 2460 

2461 TTCCATTATTTTAATACTTGCAAAAACATTACTAAGTATACCAAAATAATTGACTCTATT 2520 

2521 ATCTGAAATGAAGAATAAACTGATGCTATCTCAACAATAACTGTTACTTTTATTTTATAA 2580 

2581 TTTGATAATGAATATATTTCTGCATTTATTTACTTCTGTTTTGTAAATTGGGATTTTGTT 2640 

2641 AATCAAATTTATTGTACTATGACTAAATGAAATTATTTCTTACATCTAATTTGTAGAAAC 2700 

2701 AGTATAAGTTATATTAAAGTGTTTTCACA7TTTTTTGAAAGAC 2743 

FIG.5d 
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I mLQMYVYIYLFMLIAAGPVOLNEGSEREENVEKEGLCNACAWRQNTR 50 

mil iiiiiiiii iiiiiiii II iiiiiiiiiiii mil 

. I MQKLQLCVYIYLFMLIVAGPVOLNENSEQKENVEKEGLCNACTWRONTK 49 
51 YSRIEAIKIOILSKLRLETAPNISKDAIRQLLPRAPPLRELIDQYOVQRD 100 

iiiiiiiiiiiiiiiiiiiiiiiii mm iiiiiiiiiimiii 

50 SSRIEAIKIQILSKLRLETAPNISKDVIRQLLPKAPPLRELIDQYOVQRD 99 

101 OSSOGSLEOOOYHATTETIIT^PTESOFLMQADGKPKCCFFKFSSKIQYN 150 

llllllllilllllilllllillllllllll llllllllllllllllll 
100 DSSDGSLEDDDYHATTETIimESDFLMOVDGKPKCCFFKFSSKIQYN 149 

151 KWKAQLWIYLRPVKTPTTVFVQILRLIKPMKDGTRYTGIRSLKLDMSPG 200 

llllllllilllll llllllllllllllllllllllllllilim II 
150 KWKAQLWIYLRPVETPTTVFVQILRLIKPMKOGTRYTGIRSLKLDMNPG 199 

201 TGTOIOVKTVLQNWLKQPESNLGIEIKALDENGHDLAVTFPGPGEDGL 250 

llllllllillllllllllllllllllllllllllllllllllllillll 
200 TGIWQSIDVKTVLQNWLKQPESNLGIEIKAIDENGHDLAVTFPGPGEOGL 249 

251 NPFLEVKVTDTPKRSRROFGLDCDEHSTESRCCRYPLTVDFEAFGWDWII 300 

lllllllllllllllllllllllllllllllllllillllllllllim 
250 NPFLEVKVTOTPKRSRRDFGLDCDEHSTESRCCRYPLTVDFEAFGWDWII 299 

301 APKRYKANYCSGECEFVFLQKYPHTHLVHQANPRGSAGPCCTPTKMSPIN 350 

llllllllllllllllllllllllllllllllllllimilllilllll 
300 APKRYKANYCSGECEFVFLQKYPHTHLVHQANPRGSAGPCCTPTKMSPIN 349 

351 MLYFNGKEQIIYGKIPAMWDRCGCS 376 

llllllllllllllilllllllllll 
350 MLYFNGKEQIIYGKIPAMWORCGCS 375 
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FrCtTRE 1^ 



L/l ^ 31/11 

ATc CM A.\.\ era cx\ ctc tct err tat att tac ctc ttt atg ctg att ctt cct cgt cca 

HQ K *- C ^ C ■/ Y I Y L r M L I V* A * C 
61/:i 91/31 

ctc CAT C7A .--^r CAC AAC ACT CAC CAA AAA CAA AAT CTG CAA AAA GAG CCG CTG TCT AAT 

V D C N H N' S H Q K E N V E K E C L* C M 
121/41 151/51 

CCA TCT ACT TCC ACA C.\A A.\C ACT AAA TCT TCA AGA ATA CAA CCC ATT A.\A ATA CAA ATC 
ACT-aCN T K SSR I £ A I K I Q I" 
131/al 211/71 

CTC ACT A-U CTT CCT CTC CAA ACA CCT CCT AAC ATC AGC AAA CAT CCT ATA ACA C.\A CTT 
L SX LRLE T A ? N I S K DA I RQ*L 
241/31 271/91 

TTA CCC AAA CCT CCT CCA CTC CCG CAA CTG ATT CAT CAC TAT CAT CTC OiG AGG CAT CAC 

L ? K A ? ? L ?. S L I D Q r D V Q R 0 D 
301/101 331/111 

AGC ACC CAT CCC TCT TTG CAA CAT CAC CAT TAT CAC CCT ACA ACG CAA ACA ATC ATT ACC 

SSDCSLEDDDYHATTETIIT 
351/::i 391/131 

ATC CCT ACA GAG TCT CAT TTT TTA ATC CAA CTG CAT CCA AAA CCC AAA TGT TGC TTC TTT 
MPTESDF LM-QVOCK?KCCFr*' 
421/141 451/151 

AAA TTT ACC TCT AAA ATA CAA TAC AAT AAA CTG CTA AAC CCC CAA CTA TGG ATA TAT TTG 
a.?ssx:q Y NKVVKAQ LW IVL 
431/151 £11/171 

ACA CCC CTC CAC ACT CCT ACA ACA CTC TTT CTC CAA ATC CTC ACA CTC ATC AA.^ CCT ATG 
?• r V £ T ? T T V F V Q I L R L : K ? M 
=41/131 571/191 

A^A CAC CCT ACA AGG TAT ACT CCA ATC CCA TCT CTG AAA CTT CAC ATG AAC CZX CGC ACT 
aC.CTRYTG IRS LXLDMNPGT 
SQl/201 631/211 

Cr: ATT TGG CAG AGC ATT CAT CTG AAG ACA GTG TTG CAA AAT TGG CTC AAA C\A CCT GAA 
CIWQSIDV JCTVLQNWL KQPE 
651/221 691/231 

TCC AAC TTA CGC ATT CAA ATA AAA CCT TTA CAT CAC AAT CGT C\T GAT CTT CCT GTA ACC 
SJ^LCrsIKALDSNCKCLAVT 
721/241 731/251 

TTC CCA CGA CCA CGA CAA GAT CGC CTG AAT CCC TTT TTA CAC CTC AAC CTA ACA CAC ACA 
rPGPGEDC LNPFLSVKVTD T 
731/261 ail/271 

CCA AA.\ A^ TCC ACA AGG CAT TTT CGT CTT CAC TCT CAT CAC CAC TCA AC\ GAA TCG CCA 
PKRSRRD ? CLDCDEKSTESR 
841/281 871/291 

TGC TGT CGT TAC CCT CTA ACT CTC CAT TTT CAA CCT CTT CGA TGG GAT TCG ATT ATC CCT 
CC RY P LTV D FEALGWDW r t A 
901/301 931/311 

CCT AAA ACA TAT AAG CCC A.\T TAC TCC TCT CGA CAG TGT CAA TTT CTA TTT TTA CAA AAA 
PKR .YKANYCSCECEFVFLQK 
961/321 991/331 

TAT CCT CAT ACT . CAT CTG CTA CAC CAA CCA AAC CCC ACA CGT TCA CCA CGC CCT TCC TGT 

VPKTHLVHQANPRCSAGPCC 
1021/341 1051/351 

ACT CCC ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT TTT AAT CGC AAA CAA C^A ATA ATA 
TP -TKMSP I NMLYFMCKEOrT 
1031/361 llU/371 

TAT CGC AAA ATT CCA CCC ATC CTA GTA CAC CCC TGC CGC TGC TCA TCA 

V GKIPAMVVDRCCCS* 
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i/i ai/u 

A7C CU .^'A CTG C.U ATC TCT GTT TAT ATT TAC CTA TTT ATG C7C ATT CTT CCT CC^ r^iv 
M C K u C I S V Y r Y L F M L I V A C " 
01/21 SU2i 

G7G GAT CTG .UT GAG .^w^^C ACC GAG CAG AAG GAA AAT CTG CA.\ AAA GAG CGG CTG TCT AAT 

VD'-^•H^^SEQKE^^VEKECLCN 
m/n LSi/si 

GC\ TG7 ..G TGG AGG CAA PJKC ACT ACA TCC TCA ACA CTA CkA CCC ATA AAA A^T C\A ATC 
ACLWRE NTTSSHLEAIKIQ*! 
13L/61 2L1/71 

CTG ACT AA.!. CTT C::C CTG CAA ACA CCT CCT AAC ATC AGC AAA GAT GCT ATC ACA CVA CTT 

^.^:<L^'-ETAPNrSKDAlRn'L* 
241/31 271/91 

TTC CCC A.^ GCT CCT CCA CTC CTG GAA CTG ATT CAT CAG TTC CAT CTC CAG ACA C\-^ GCC 
t-PKA?? L L E Lr OQTDVQKD^A 
301/101 331/111 

AGC ACT CAC GGC TCC TTG CAA CAC GAT GAC TAC CXC CCC AGG ACG GAA ACC CTC A-^T A^C 
SSDGSLED DDYKARTETVTT* 
3ol/l2l 391/131 

ATG CCC ACG CAG TCT CAT CTT CTA ACC CAA CTC CAA CGA AAA CCC AAA TCT TCC TTC TTT 
H?TESO L L TQVSGKPKc'cf'f 
421/141 431/lSl 

AAA TTT ACC TCT AAG ATA CAA TAC AAT AAA CTA CTA AAG CCC C\A CTG TGG A-^^ T^*^ CTG 
:<?SS5: Q Y NXLVKAQUWiY*r 

. 511/171 

A::G CCT C7C AA.3 act C_. CCG A.CA CTG TTT CTG CAA A.TC CTG ACA CTC ATC AA-A CCC ATG 
r V >: 7 ? A T V f V C I 1, H L I :< a"* m'" 
S^-/:3l 571/131 ' " 

AAA CAC CG7 ACA ACG TAT ACT CGA ATC CCA TCT CTG AAA CTT CAC ATG AAC CZX CGC \CT 
XOCTRYTC IP. SLKLDM NPCT 
601/201 631/211 

CGT A.* TGG C\G ACC ATT CAT CTG AAC ACA CTC TTG C\C AAC TGG CTC AAA OA CCT CAA 
C^'^'CSC D V KTVL QNW LKQ ?"e 
651/221 651/231 

TCC AAC TTA CGC ATT GAA ATC AAA GCT TTA CAT CAG AAT CGC CAT GAT CTT CCT CTA ACC 
^NLClEt KALDENCHOLAVT 
721/241 ^ . 7SL/2S1 

...C CCA CAA CCA CGA CAA CAT CGA CTC ACT CCT TTT TTA CA.A CTC AAG CTA AC\ GAC AC\ 
^P-^GEDC l'tPFLEVKVTDT 
731/2S1 ail/271 

CCA AAA AGA TCT AGG AGA CAT TTT CGG CTT CAT TCT GAT CAA CAC TCC ACA GAA TCT CCA 
PKRSRROF CLDCD EHSTESR 
841/231 871/291 

TCC TCT CCr TAC CCT CTA ACT CTC CAT TTT CAA CCT TTT CGA TGG GAT TGG ATT ATT CCA 

^C^ YPLTVDFEA FCWDWIIA 
901/301 935^/311 

CCT AAA AGA TAT AAG GCC AAT TAC TCC TCT CCA CAA TCT CAA TTT CTA TTT TTG CAA AAC 

PKRYKANYCSCECEFVFLQK 
961/321 991/331 ^ Q K 

TAT CCT CAT ACC CAT CTT CTG CAC CAA CCA AAC CCC ACA CCT TCA CCC CGC CCC TCC TCT 

^PKTKLVHQANPRGSACPC C 
1021/341 1QS1/3S1 

ACT CCT ACA AAC ATC TCT CCA ATT AAT ATC CTA TAT TTT AAT CGC CAA CGA CAA ATA AT\ 
^ ^ H S P I NMLYFNCEGQT t" 

1031/361 1111/371 - G Q I r 

TAC CCG AAC ATT CCA CCC ATC CTA CTA CAT CCC TCT CCC TCT TCA TCA 
Y<3KIPAMVVDRCCCS- 
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FIGintE He 



1/L 

ATC C.^^ .V^C C7A CCA CTC TAT CTT TAT ATT TAC CTG TTC ATC CAC ATC CCC CTT CAT CCG 
yi^ Q L .\ ^' r V r r Y L F M Q I A V D P 

51/21 ^U^l 

CTG CCT CTG CAT CCC ACT ACT GAG CCC ACA CAC AAC CCT CAA AAA CAC CGA CTG TGC AAT 

L Z Z S S Q PTEMAEKDCLCN 
121/41 L51/51 

CCT TGT ACC TGC ACA CAC AAT ACA AAA TCC TCC ACA ATA CAA CCC ATA AA.\ ATT CAA ATC 

a"*' c ' T y ?. C ^^ T K s s a i e a i k r q i 
iai/61 211/71 

CTC ACC CTG CCC CTG CAA CAA CCA CCT AAC ATT ACC ACG CAC CTT ATT A\G CAG CTT 

LsKLRLEQAPNlSaOVIKQL 
241/31 271/91 

T^\ CCC AAA CCT CCT CCA CTC CAC CAA CTC ATT GAT CAG TAT CAT CTC CAC ACC CAC CAC 
l"pKA??LQ ELI DQYDVQROD 
301/101 33L/IH 

ACT ACC CAT CGC TCT TTC CAA CAC CA.T CAC TAT CAT CCC AC\ ACC GAG ACG ATT ATC ACA 
5SDCSLED ODYHATTETIIT 
3S1/121 391/131 

ATG CCT ACG CAG TCT CAT TTT CTT CTA CAA ATG CAG CGA AAA CCA AAA TGT TGC TTC TTT 
K*PT£SOcLVQMECKPKCCcF 
421/141 4S1/151 

AAC TTT AGC TCT AAA ATA CAA TAT AAC AAA CTA CTA AAG CC\ CAA TTA TCC ATA TAC TTC 

xf'sski Q Y nkvvkaqlw IY L 

iai/lol 511/171 ^ 

AGG C.-A CTC C---. AAA CCT ACA A-TG CTG ... u.G CAC ATT CTG ACA CTC A,. AVG CTC A.TG 

C V C K r T T V =■ V Q I I. ?. 1 : K P H 
541/131 571/191 

lAA. CAC GGT ACA AGA TAT ACT CGA ATT CGA TCT TTG A^A CTT CAC ATC AAC CCA CGC ACT 
3CDCTRYTCIRSLKLDMNPCT 
601/201 S31/2U 

CCT ATC TGG C\G ACT ATT CAT CTG AAG ACA CTC CTG CAA AAT TGC CTC AAA CXG CCT CAA 
crWQSIDVKTVLQNWLKQPE 
6cl/221 631/231 

TCC AAT TTA CGC ATC CAA ATA AAA CCT TTT CAT GAG ACT CGA CCA CAT CTT CCT CTC ACA 
SNLGIEI KAFOETCROLAVT 
721/241 731/2S1 

TTC CZX CG.-. CCC CCT CAA. CAT CGA TTG AAC CCA TTT TTA CAC CTC ACA CTT AC^ CAC ACA 
T3CPCZ0C LNPFLEVRVTDT 
731/2S1 ail/271 

CCG AAA CCG TCC CGC AGA CAT TTT CGC CTT CAC TGT CAT CAC CA" TCA ACC CAA TCC CCA 
P3CRSRRDFGLDCDEKSTESR 
841/281 871/291 

TGT TGT CGC TAC CCG CTG ACA CTC CAT TTC CAA CCT TTT CCA TGG CAC TCC ATT ATA CCA 
C'C RY*? I T V -D F.EA F CW D W I I A 
901/301 931/311 

CCT AAA AGA TAC AAA CCC AAT TAC TGC TCC CGA CAA TGC CAA TTT CTC TTT CTA C\G AAA 
PKRYKAKYCSCECE FVFLQK 
961/321 991/331 

TAC CCG CAC ACT CXC CTG CTA CAC CAA CCA AAT CCC ACA CGC TCA CCA CGC CCT TCC TGC 
YPHTKLVKQANPRCSAC PCC 
1021/341 1051/351 

ACA CCC ACC AAG ATG TCC CCT ATA AAC ATC CTC TAT TTC AAT CGA AAA CAA CAA ATA ATA 

TPTK 'ksp I NMLYFMCKEQI I 

1031/351 1111/371 

TAT CGA AAC ATA CCA CCC ATC CTT CTA CAT CCT TGC CCC TCC TCA TCA 

YCKrPAMVVORCCCS* 
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i/i n/ix 

KTQ XT7 C.U A.*^^ CZZ C.\A ATC TAT CTT TAT ATT TAC CTC TTT CTG CTG ATT OCT CCT CCC 
HIQKPQM rvviVLFVLlAAG 
61/2L 91/31 

CCA CTC CAT CTA .;.\7 CAC CAC ACT GAG ACA GAG CCC AAT CTC CA.\ AAA GAG CCC CTG TGT 
PVDLNID S EREANVEKHGLC 
121/41 151/51 

AAT CCC TOT CZZ TCC ACA CAA AAC AC\ ACC TAC TCC ACA ATA CAA CCC ATA AA.\ ATT d-A 
>I A C A W R Q M T ?. Y S R r £ A r K r Q 

lax/oi 2U/71 

ATC CTC ACT A-^uV CT^ CGC CTG CAA ACA GCG CCT A.\C ATC ACC AAA GAT CCT ATA ACA CAA 
I^LSKL^L ETAPNISKDAIRQ 
2U/81 271/3L 

CTT CTG CZC ACA CCG CCT CCA CTC CGC GAA CTG ATC GAT CAC TAC CAC CTC CAC ACC GAT 
LLPaA? ? LRELXD QYOVQRD 
301/101 331/lLl 

CAC ACC ACT CAC CCC TCT TTG CAA GAT CAC GAT TAT CAC CCT hCZ ACC CAA ACA ATC ATT 
DSSCCSLEODDYKATTETrr 
3ol/121 331/131 

ACC ATC CC? AC- CAC TCT GAC TTT CTA ATG CAA CCC CAT CCA AAC CCC AAA TCT TCC TTT 
THPTESD ? LMQADCKPKCCr 
4II/1U 451/151 

TTT AA.\ TTT ACC TCT AAA ATA CAC TAC AAC AAA CTG CTA AAC CCC GAG CTG TCC ATA TAT 
FK'rS^SiCI QYNKVVKAQLWIY 
431/lfil 511/171 

CTG ACA CCC CTC AAC ACT CCT ACA AC\ CTC TTT CTC CAA ATC CTC ACA CTC ATC AAA CCC 

u a A V 7 7 3 T T V ? V c : L ?. L I X ? 
sunn 571/191 

ATG A-iA CAC CCT ACA ACC TAT ACC CCA ATC CCA TCT CTC AAA CTT GAC ATC AC-C CCA CCC 
MXDCTRY T G laSLX LDMS ?G 
fiQl/201 531/211 

ACT GCT ATT TCC CXu ACT ATT CAT CTC AAC ACA CTC TTG CAA AAT TCC CTC AAA CAC CCT 
TGIWQSI DVKTV.LQKWLECQP 
fi51/2Zl * 631/231 

CAA TCC AA.C TTA CCC ATT GAA ATC AAA CCT TTG CAT GAC A.!iT CCC CAT CAT CTT CCT CTA 
iSK LGIE I KAt-DEMCHDLAV 
7;i/2U 7S1/2S1 

ACC TTC CCA CCA CCA CCA CAA CAT CCC CTC AAT CCC TTT TTA CAA CTC AAA CTA ACA CAC 
T??C?CS D CLKPFLEVKVTD 
731/231 • 811/271 

AC\ CCC AAC ACC TCC CCC ACA GAC TTT CCC CTT CAC TCT CAT GAA CAC TCC ACC CAA TCC 
TPKRSRRD FCLDCDEHSTSS 
841/231 871/291 

CCG TCC TCT CCC TAC CCC CTC ACC CTC CAT TTC CAA CCC TTT CCA TCC CAC TCC ATT ATT 
acCRYPLTVDFE AFCWDWir 
90U301 931/311 

CCA CCC AAA ACA TAT AAC CCT AAT TAC TCC TCT CCA CAC TCT GAA TTT CTC TTC TTA CAA 
APKRTJCA NYCSGECEFVFLQ 
951/321 991/331 

AAA TAT CCC CAT ACT CAT CTT CTG CAC C\A CCA AAC CCC ACA GCC TCC CCA CGC CCT TCC 
3:r?HTKLVKQAK??. CSACPC 
1021/341 1051/351 

TCC ACC CCA ACA AAA ATC TCT CCC ATT AAT ATC CTA TAT TTT AAT GCC A.\A CAA CAA ATA 
CTPTXMS P IKMLYFNCKEQI 
lOai/361 1111/371 

ATA TAT CCC AAA ATT CCA GCC ATC CTA CTA CAC CCC TCT CCC TCC TCC TCA 
lYCKlPAKVVORCGCS- 
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aTC C.\A AAC C7A CCA CTT TAT CTT TAT ATT TAC CTC TTC ATG CAC ATT TTA CTT CAT CCG 
rt'aKLXVYVr XYLFKQILVKP 
6V21 51/31 

CTC CCT CTT CAT CCC A^T ACT CAC CCC AC\ CAC AAC CCT CAA AAX CAC CCK CTG TCC AAT 

v'aldcssq ptewaekdclcm 

I21/4L 

ACC T=w ACA CXC AAT ACT AAA TCC TCC ACA ATA CAA CCC ATA KK\ ATT C\A ATC 

;^^*C 'r"'^'*^ ^ ^ ^ ^ * EAIK IQ I 
iSim 211/71 

AAA CT- CCC CTC CAA CAA CCA CCT A.\C ATT AGC AGC CAC CTT ATT AAA CAA CTT 

r'^S^Kc'^.Li Q A PK I S Ri3 V- I KQ L 
Z«l/5l ^'^^51 

— \ c-c a-vx cct cct c=c ctc cac caa ctg att cat cxg tat cac ctc cac aca cac cac 

^"p""ka??i- Q z l z ccvdvcko D 

301/lCl 331/111 

ACT AGC CAT CCC TCT TTC CAA CAC GAT CAC TAT CAT CCC AC\ ACC CAA AC3 ATT ATC ACA 
5*5DGSLHD DDYKATTETIIT 
3SV121 331/131 

A-C CCT ACC CAC TCT CAT TTT CTT CTA CV\ ATC CAC CCA AAA CCA AAA TCT TGC TTC TTT 
wpTHSr? LVQ*IS GK?KCC?5 
4*21/141 4S1/1S1 

XAG *^ ACC TCT A.M ATA CAA TAT AAC AAA CTA CTA AAG CCA CAA TTA TGC ATA TAC TTC 
xis5X:QY^^XVVKAQCWIYL 

431/151 . 511/171 _ 

ACC OA CTC CAA AAA CTT AC\ ACC CTC TTT CTC CAC ATC CTC ACA CTC A,- AAA CCC ATC 
r r' Q .^ ? T T V ? V Q I L X : t ? K 

c-/^l S71/151 

AAA CAC CCT ACA AGA TAT ACT CCA ATT -CCA TCT TTC AAA CTT a\C ATC AAC CCA CCC ACT 
3CDCT?. rTC IRSLXLOMNPGT 
501/201 531/211 

CCT ATC TGC CVC ACT ATT CAT CTC AAC ACA CTC r?3 CAA AAT TCC CTC AAA CAC CCT CAA 
crWQSrDVKTVL QKW LKQ? 2 
651/221 591/231 

T-t AAT TTA CCC ATC CAA ATA AAA CCT TTT CAT CAC K\7 CCA CCA CAT CTT CCT CTA AC\ 
SKLClZtKAFDENCRDLAVT 

7-)i/l41 7S1/2S1 

•^C CCA CCA CCA CCT CAA CAT CCA CTC AAC CCA TTT TTA CAC CTC ACA CTT ACA CAC ACA 
T?c? C'£DC Lr:?FLEVXVTCT 
Tsl/Sil 311/271 

CC*" .\AA CCC TCC CCC ACA a=vT TTT GCC CTT GAT TCC CAC CAC CAC TCA ACC CAA TCT CCA 
? K RS r, .^CF CLDCDEKSTES?. 
841/251 871/291 

TCT TCT CCC TAC CCC CTC ACA CTG CAT TTT CAA CCT TTT CCA TCC CAC TGC ATT ATA CCA 
CCAY?LTVD fEA? cWOWIIA 
9O1/-301 931/311 

CCT AAA ACA TAC AAA CCC AAT TAC TCC TCT CCA CAA TCT CAA TTC CTA TTT CTA CAC AAA 
PKRY^ANrCSGECEFVFLGK 
361/321 991/331 

TAC CCC CAC ACT CAC CTC CTA CAC CAA CCA AAT CCA ACA CGC TCA CCA CCC CCT TCC TCC 
./ 7 H T K L V H Q A K ? R C S A G ? C C 

l;Zl/3tl . 1C51/351 

ACA CCC ACC AAG ATG TCC CCT ATA AAC ATC CTC TAT TTC AAT CCA AAA CAA CAA ATA ATA 
TPTKKSP I UHUrTt^OZtQl I 
1081/351 llLl/371 

TAT CCA AAC ATA CCA CCC ATC CTT CTA CAT CCT TCC CCC TCC TCA TCA 
TCXIFAKVVDBCCCS- 
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ATC CAA AAA CTC CX\ ATC TAT / TAT ATT TAC CTC TTT ATC CTG ATT CTT CCT CC-i* CCC 
MQKLClVVYlYLFMtlVAGP 
61/21 91/31 

CTG CAT CTG A.\7 GAG AAC AGC CAC CAA AAG CAA AAT CTG CAA AAA GAG GGG CTG TGT AAT 
VDL^^H^'S E qkenveke glc'n 
12L/41 151/51 

CCA TGT ATG TGG AGA C,\A AAC ACT AAA TCT TCA AGA CTA GAA GCC ATA AAA ATT CAA ATC 
ACMWRCNT KSSRLEAIKIQI 
191/61 211/71 

CTC ACT AAA CTT C5C CTG GAA ACA CCT CCT AAC ATT AGC AAA GAT CCT ATA AGA CAA CTT 
LSKL^'-ET A PWrSKDAlRQL 
241/31 271/91 

TTG CCC AA.\ GCT CCT CCA CTC CCC GAA CTG ATT GAT GAG TAC GAT GTC CAG AGA GAT GAC 
LPKArPLR ELrDQYDVQRDD 
301/101 331/111 

AGC ACT GAT GGC TCC CTG GAA GAT GAT CAT TAT CAC CCT ACG ACG GAA ACG ATC ATT ACC 
5SDGS-ED DDYHATTETIIT 
351/121 391/131 

ATG CCT ACA GAG CCT GAT CTT CTA ATC CAA GTG GAA GGA AAJl CCC AAA TGC TGC TTC TTT ' 
M?TH5DLLMQVEGK?KCC?F 
421/141 431/151 

AAA TTT AGC CCT r.U ACA CAA TAC AAT AAA CTA GTA AAG GCC C\A CTG TGG ATA TAT CTC 
KFSSKIQYNKVVKAQL W IYL 
431/161 511/171 

AGA CCC GTC A.\G ACT CCT ACA ACA CTC TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC ATG 
Rpv:<T?TTVFVQlLRLrX?M 
541/181 571/191 

AAA GAC GGT AC\ AGG TAT ACT GGA ATC CGA TCT CTG AAA CTT GAC ATG AAC CCA GGC ACT 
KDG TRYTG IRSLKLDMNPGT 
601/201 631/211 

ggt att tgg cac agc att gat gtc aag aca gtg ttg caa aat tgg ctc aaa caa cct gaa 
giwqs:dv.ktvlqnwlkc?e 

651/221 691/231 

TCC AAC TTA GGC ATT G.-A ATC AAA. CCT TTA GAT GAC AA.T GGT CAT GAT CTT CCT CTA ACC 
S N' L C : E r K A L D £ N G H D L A V T 
721/241 751/251 

TTC CCA GGA. CCA GGA GAA GAT GGG CTG AAT CCC TTT TTA CAA GTC AAG GTA ACA GAC ACA 
rPCPCEDG LNPFLEVKVTDT 
7B1/251 ail/271 

CCA AAA. AGA TCC AGG .A-GA GAT TTT CCA CTC G.AC TGT CAT GAG CAC TCA ACA GAA TCT CCA 
rERS?. RDr G LDCDEKSTESR 
341/231 871/291 

TCC TGT CGT TAC CCT CTA ACT GTG CAT TTT GAA CCT TTT GGA TGG GAC TGG ATT ATT GCA 
CCR-V ?LTVDFEArCWDWIIA 
901/301 * 931/311 

CCC AAA AGA TAT AAG GCC AA.T TAC TGC TCT GGA GAJ3 TGT CAA TTT CTA TTT TTA CAA .AAA 
PKRYKANYCSGECErVFLQK 
961/321 991/331 

TAC CCT CAC ACT CAT CTT GTG CAC CAA GCA AAC CCC AGA GGT TCA GCA GGC CCC TGC TGT 
YrHTHLVKQANPRGSAGPCC 
102L/341 1051/351 

ACT CCC ACA AAG ATG- TCT CCA ATC AAT ATG CTA TAT TTT AAT GGC AAA GAA CAA ATA ATA 

TP TK'MSPI NMLYFNGKEQII 

1081/361 1111/371 

TAT GGG AAA ATT CCA CCC ATG GTA GTA GAT CGC TGT GGG TGC TCA TCA 

YGKIPAMVVDRCGCS* 
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l/l 31/11 

A-G CAA A.\A era C.\.\ ATC TTT CTT TAT ATT TAC CTA TTT ATC CTG CTT CTT CCT CGC CCA 

m'qklcifvtiylfmluvacp 

61/21 91/31 

CTG CA-^ CTC .UT CAC A^C AGC GAG CAG AAG GAA AAT CTG CAA AAA AAG GGG CTG TCT AAi 
v' p ' r N' H S S E Q K E M V E K K C L C N 
131/n " 151/51 

CCA TCC TTC TOG AdA C.-A A.\C A.AT AAA TCC TCA AGA CTA GAA CCC ATA AA-\ ATC CAA ATC 
3,"' C L W R C » N K S S n L E A I K I Q I 
191/61 211/71 

c-c aot aag ctt ccc ctc gaa aca cct cct aac atc agc aaa cat cct ata aga c.\a c:i 
l'sklkletapniskdairql 

''41/31 271/91 

C"- a=.G CCT CCT CCA CTC CGG GAA CTG ATT GAT CAG TAC GAT GTC CAG AGA GAT GAC 
L*"d^KA? r L H E L IDQYDVQ RD D 
301/101 331/111 

AGC AGC G-C CCC TCC TTC CA_A GAC GAT GAC TAC CAC GTT ACG ACG GAA ACG GTC ATT ACC 
SSDGSLED DDYKVTTETVIT 
35V121 391/131 

A-G CCC ACG GAG TCT CAT CTT CTA CCA GAA GTC CAA GAA AAA CCC A.\A TGT TCC TTC TTT 
y pT.c-SDLL AEVQEK?KCC?F 
V21/14I * " 451/151 

iii T-^ AGC TCT AAG ATA C\.A CAC AAT AAA GTA GTA AAG CCC CAi^. CTG TGG ATA TAT CTG 
k"' r' S S K I G K H K V V K A Q L W I Y L 
431/161 511/171 

AGA CCT GTC AAG ACT CCT ACA ACA CTG TTT GTC CAA ATC CTG ACA CTC ATC AAA CCC ATG 
j^:3VK"'?TTVFVQrLRt4lX?M 

*5;i/i31 * ' 571/191 ^ 

iiA CAC CCT ACA AGG TAT ACT GGA ?"C CCA TCT CTG AAA CTT GAC A.G A.-.W —.1. 

G T ?. Y T G I K S L K L G M N ? G T 
SC1/2G1 631/211 

CG^ A-^ TGG CAC AGC ATT GAT CTG AAG ACA CTC TTG CXk AAC TGG CTC AAA CA.A CCT GAA 
c"-*WQS : D V KTVLQ NW L KQ ? E 
651/221 691/231 

--C AAC TTA CGC ATT GAA ATC AAA CCT TTA GAT GAG AAT GGT CAT GAT CTT CCT GTA ACw 
SMLG IHIKALDENGKDLAVT 

j-.^/lV 751/2S1 

T^C CCA GA.A CCA GGA GAA GAA GGA CTG AAT CCT TTT TTA GAA GTC AAG CTA ACA GAC ACA 
c'pEPGEEG LNPFLEVKVTDT 
731/261 811/271 

CZX >,K\ ACA TCT AGG AGA GAT TTT GGG CTT GAT TGT CAT GAG CAC TCC ACA C;^.A TCT CCA 
^>E:RS?. RDrCLDCDEHSTESR 

841/291 871/291 

TCC TGT CGT TAC CCT CTA ACT GTC CAT TTT GAA CCT TTT GGA TGG GAT TGG ATT ATT GCA 
CC RYP LTV D F EA FCWDW I I A 
901/3C1 931/311 

CCT AAA AGA TAT AAG CCC AAT TAC TCC TCT GGA CAA TCT CAA TTT TTA TTT TTG CAA AAG 

^•/ry-kanycsgeceflflqk 

951/321 991/331 

TAT CCT CAT ACC CAT CTT CTG CAC CAA GCA AAC CCC AAA GGT TCA CCC GGC CCT TGC TCT 
VOHTKLVKQANPKGSACPCC 

1021/341 1051/351 

p^j^^ ^r^^ CCA ATT AAT ATG CTA TAT TTT AAT CGC AAA GAA C^A ATA ATA 

t" o T K y. S ? I N K L Y F N G K E <2 I I 
1031/351 1111/371 

TAT GGG AAG ATT CCA CGC ATG GTA GTA GAT CGC TGT GGG TCC TCA TCA 
YGXIPCMVVDRCCCS* 
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